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Novel cell cycle genes and uses thereof 



The present invention relates to DNA sequences encoding cell cycle interacting proteins 
as well as to methods for obtaining the same. The present invention also provides 
vectors comprising said DNA sequences, wherein the DNA sequences are operatively 
linked to regulatory elements allowing expression in prokaryotic and/or eukaryotic host 
cells. In addition, the present invention relates to the proteins encoded by said DNA 
sequences, antibodies to said proteins and methods for their production. Furthermore, 
the present invention relates to regulatory sequences which naturally regulate the 
expression of the above described DNA sequences. The present invention also relates 
to a method for controlling or altering growth characteristics of a plant and/or a plant cell 
comprising introduction and/or expression of one or more cell cycle regulatory proteins 
functional in a plant or parts thereof and/or one or more DNA sequences encoding such 
proteins. Also provided by the present invention is a process for disruption plant cell 
division by interfering in the expression of a substrate for cyclin-dependent protein 
kinase using a DNA sequence according to the invention wherein said plant cell is part 
of a transgenic plant. The present invention further relates to diagnostic compositions 

present 

invention also relates to methods for the identification of compounds being capable of 
activating or inhibiting the cell cycle. Furthermore, the present invention relates to 
transgenic plant cells, plant tissue and plants containing the above-described DNA 
sequences, regulatory sequences and vectors as well as to the use of the 
aforementioned DNA sequences, regulatory sequences, vectors, proteins, antibodies 
and/or compounds identified by the method of the invention in plant cell and tissue 
culture, plant breeding and/or agriculture. 
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Several documents are cited throughout the text of this specification. Each of the 
documents cited herein (including any manufacturer's sp cifications. instructions, etc.) 
are hereby incorporated herein by reference; however^here is no admission that any 
document cited is indeed prior art as to the present invention. 

Cell division is fundamental for growth in humaas. animals and plants. Prior to dividing 

in two daughter cells, the mother cell needs to replicate its DNA. The cell cycle is 

traditionally divided into 4 distinct phases: 

G1 : the gap between mitosis and the onset of DNA synthesis; 

S : the phase of DNA synthesis: 

G2: the gap between S and mitosis; 

M : mitosis, the process of nuclear division leading up to the actual cell division. 
The distinction of these 4 phases provides a convenient way of dividing the interval 
between successive divisions. Although they^ have served .a useful-purpose, a recent 
flurry of experimental resiultst much of it as a consequence of cancer research, has 
resulted in a more intricate^picture of the cell. cycle's "four seasons" (Nasmyth, Science 
274. 1643-1645. 1996; Nurse, Nature, 344, 503-508* 1 990):"The underiying mechanism 
controlling the cell cycle control system has only recently been studied in greater detail. 
In all eukaryotic systems, including plants, this control mechanism is based on two key 
families of proteins which regulate the essential process of cell division, namely protein 
L-:r.oo<^c /r-wHm Hon<3nH«>nt kin»RA<; or CDKs^ and their activatino associated subunits, 
called cyclins. The activity of these protein complexes is switched on and off at specific 
points of the cell cycle. Particular CDK-cyclin complexes activated at the G1/S transition 
trigger the start of DNA replication. Different CDK-cyclin complexes are activated at the 
G2/M transition and induce mitosis leading to cell division. Each of the CDK-cyclin 
complexes execute their regulatory role via modulating different sets of multiple target 
proteins. Furthermore, the large variety of developmental and environmental signals 
affecting cell division all converge on the regulation of CDK activity. CDKs can therefore 
be seen as the central engine driving cell division. 

In animal systems and in yeast, knowledge about cell cycle regulations is now quite 
advanced. The activity of CDK-cyclin complexes is regulated at five levels: (i) 




transcription of the CDK and cyclin genes; (ii) association of specific CDK's with their 
specific cyclin partner, (iii) phosphorylation/dephosphorylation of the CDK and cyclins; 
(iv) interaction with other regulatory proteins such as SUC1/CKS1 homologues and cell 
cycle kinase inhibitors (CKI); and (v) cell cycle phase-dependent destmction of the 
cyclins and CKIs. 

The study of cell cycle regulation in plants has lagged behind that in animals and yeast. 
Some basic mechanisms of cell cycle control appear to be conserved among 
eukaryotes, including plants. Plants were shown to also possess CDK's, cyclins and 
CKI's. However plants have unique developmental features which are reflected in 
specific characteristics of the cell cycle control. These include for instance the absence 
of cell migration, the fomnation of organs throughout the entire lifespan from specialized 
regions called meristems, the fomnation of a cell wall and the capacity of non-dividing 
cells to re-enter the cell cycle. Another specific feature is that many plant cells, in 
particular those involved in storage (e.g. endospenn), are polyploid due to rounds of 
DNA synthesis without mitosis. This so-called endoreduplication is intimately related 
with cell cycle control. 

Due to these fundamental differences, multiple components of the cell cycle of plants 
are unique compared to their yeast and animal counterparts. For example, plants 
contain a unique class of CDKs, such as CDC2b in Arabidopsis, which are both 
structurally and functionally different from animal and yeast CDKs. 
The further elucidation of cell cycle regulation in plants and its differences and 
similarities with other eukaryotic systems is a major research challenge. Strictly for the 
case of comparison, some key elements about yeast and animal systems are described 
below in more detail. 

As already mentioned above, the control of cell cycle progression in eukaryotes is 
mainly exerted at two transition points: one in late G,, before DNA synthesis, and one at 
the GJM boundary. Progression through these control points is mediated by cyclin- 
dependent protein kinase (CDK) complexes, which contain, in more detail, a catalytic 
subunit of approximately 34-kDa encoded by the CDK genes. Both Saccharomyces 
cerevisiae and Schizosaccharomyces pombe only utilize one CDK gene for the 
regulation of their cell cycle. The kinase activity of their gene products p34^^^ and 



p34CDC28 jp 3c/). pombe and in S. cerevisiae, respectively, is dependent on regulatory 
proteins, called cyclins. Progression through the different cell cycle phases is achieved 
by the sequential association of p34°''^"" with different cyclins. Although in high r 
eukaryotes this regulation mechanism is conserved, the situation is more complex since 
they have evolved to use multiple CDKs to regulate the different stages of the cell cycle. 
In mammals, seven CDKs have been described, defined as CDK1 to CDK7. each 
binding a specific subset of cyclins. 

In animal systems. CDK activity is not only regulated by its association with cyclins but 
also involves both stimulatory and inhibitory phosphorylations. Kinase activity Is 
positively regulated by phosphorylation of a Thr residue located between amino acids 
160-170 (depending on the CDK protein). This phosphorylation is mediated by the CDK- 
activating kinase (CAK) which interestingly is a CDK/cyclin complex itself. Inhibitory 
phosphorylations occur at the ATP-binding site (the Tyri 5 residue together with Thr14 in 
higher eukaryotes) and are carried out by at least two protein kinases. A specific 
phosphatase. CDC25. dephosphorylates these residues at the GJM checkpoint, thus 
activating CDK activity and resulting in the onset of mitosis. CDK activity is furthermore 
negatively regulated by a family of mainly low-molecular weight proteins, called cyclin- 
dependent kinase inhibitors (CKIs). Kinase activity is inhibited by the tight association of 
these CKIs with the CDK/cyclin complexes. 

With respect to cell cycle regulation in plants a summary of the state of the art is given 
below. In Arabidopsis, thusfar only two UUI^ genes nave oeen isoiaieu. oL/o^«>^t -na 
CDC2bAt. of which the gene products share 56% amino acid identity. Both CDKs are 
distinguished by several features. First, only CDC2aAt is able to complement yeast 
P34COC2/CDC28 mutants. So^cond. CDC2aAt and CDC2bAt bear different cyclin-binding 
motifs (PSTAIRE and PPTALRE, respectively), suggesting they may bind distinct types 
of cyclins. Third, although both CDC2aAt and CDC2bAt show the same spatial 
expression pattern, they exhibit a different cell cycle phase-specific regulation. The 
CDC2aAt gene is expressed constitutively throughout the whole cell cycle. In contrast. 
CDC2bAt mRNA levels oscillate, being most abundant during the S and G2 phases. In 
addition, multiple cyclins have been isolated from Arabidopsis. The majority displays the 
strongest sequence similarity with the animal A- or B-type class of cyclins. but also D- 



fprlnSsi^p? 




type cyclins have been identified. Although the classification of Arabidopsis cyclins is 
mainly based upon sequence similarity, limited data suggests that this organization 
con-esponds with differential functions of each cyciin class. Direct binding of any cyclin 
with an Arabidopsis CDK subunit has, however, not yet been demonstrated. 
In order to manage problems related to plant growth, plant architecture and/or plant 
diseases, it is believed to be of utmost importance to identify and isolate plant genes 
and gene products involved in the regulation of the plant cell division, and more 
particularly coding for and interacting with CDK's and/or their interacting proteins, 
responsible for the control of the cell cycle and the completion of the S and M phase of 
the cell cycle. If such novel genes and/or proteins have been isolated and analyzed, the 
growth of the plant as a whole can be influenced. Also, the growth of specific tissues or 
organs and thus the architecture of the plant can be modified. 

Thus, the technical problem underlying the present invention is to provide means and 
methods for modulating cell cycle proteins that are particular useful in agriculture and 
plant cell and tissue cultLre. 

The solution to the technical problem is achieved by providing the embodiments 
characterized in the claims. 

Accordingly, the invention relates to a DNA sequence encoding a cell cycle interacting 
protein or encoding an immunologically active and/or functional fragment of such a 
protein, selected from the group consisting of: 
(a) DNA sequences 

(aa) comprising a nucleotide sequence encoding at least the mature form of a 
protein (LDV115) comprising the amino acid sequence as given in SEQ ID 
NO: 2; 

(ab) comprising the nucleotide sequence as given in SEQ ID NO: 1; 

(ac) comprising a nucleotide sequence hybridizing with the complementary 
strand of a nucleotide sequence as defined in (aa) or (ab) under stringent 
hybridization conditions; 



(ad) comprising an nucleotide sequence encoding a protein having an amino 
acid sequence at least 60 % identical to the amino acid sequence encoded 
by the nucleotide sequence of (aa) or (ab); 

(ae) comprising a nucleotide sequence encoding at least the domain binding to 
CDKs of the protein encoded by the nucleotide sequence of any one of 
(aa) to (ad); 

(b) DNA sequences 

(ba) comprising a nucleotide sequence encoding at least the mature form of a 
protein (LDV24) comprising the amino acid sequence as given in SEQ ID 
NO: 4; 

(bb) comprising the nucleotide sequence as given in SEQ ID NO: 3; 

(be) comprising a nucleotide sequence hybridizing with the complementary 
strand of a nucleotide sequence as defined in (ba) or (bb) under stringent 
hybridization conditions; 

(bd) comprising an nucleotide sequence encoding a protein having an amino 
acid sequence at least 60 % identical to the amino acid sequence encoded 
by the nucleotide sequence of (ba) or (bb); 

(be) comprising a nucleotide sequence encoding at least the cyclin-like 
interacting domain of the protein encoded by the nucleotide sequence of 
any one of (ba) to (bd); 

(c) DNA sequences 

(ca) comprising a nucleotide sequence encoding at least the mature form of a 
protein (VB33) comprising the amino acid sequence as given in SEQ ID 
NO: 6; 

(cb) comprising the nucleotide sequence as given in SEQ ID NO: 5; 

(cc) comprising a nucleotide sequence hybridizing with the complementary 
strand of a nucleotide sequence as defined in (ca) or (cb) under stringent 
hybridization conditions; 

(cd) comprising an nucleotide sequence encoding a protein having an amino 
acid sequence at least 60 % identical to the amino acid sequence encoded 
by the nucleotide sequence of (ca) or (cb); 




(ce) comprising a nucleotide sequenc encoding at least the domain binding to 
CDKs of the protein encoded by the nucleotide sequence of any one of 
(ca) to (cd); 

(d) DNA sequences 

(da) comprising a nucleotide sequence encoding at least the mature form of a 
protein (VB89) comprising the amino acid sequence as given in SEQ ID 
NO: 8; 

(db) comprising the nucleotide sequence as given in SEQ ID NO: 7; 

(dc) comprising a nucleotide sequence hybridizing with the complementary 
strand of a nucleotide sequence as defined in (da) or (db) under stringent 
hybridization conditions; 

(dd) comprising an nucleotide sequence encoding a protein having an amino 
acid sequence at least 60 % identical to the amino acid sequence encoded 
by the nucleotide sequence of (da) or (db); 

(de) comprising a nucleotide sequence encoding at least the domain binding to 
CDKs of the protein encoded by the nucleotide sequence of any one of 
(da) to (dd); 

(e) DNA sequences 

(ea) comprising a nucleotide sequence encoding at least the mature form of a 
protein (VBSHF) comprising the amino acid sequence as given in SEQ ID 
NO: 10; 

(eb) comprising the nucleotide sequence as given in SEQ ID NO: 9; 

(ec) comprising a nucleotide sequence hybridizing with the complementary 
strand of a nucleotide sequence as defined in (ea) or (eb) under stringent 
hybridization conditions; 

(ed) comprising an nucleotide sequence encoding a protein having an amino 
acid sequence at least 60 % identical to the amino acid sequence encoded 
by the nucleotide sequence of (ea) or (eb); 

(ee) comprising a nucleotide sequence encoding at least the domain binding to 
CDKs of the protein encoded by the nucleotide sequence of any one of 
(ea) to (ed); 
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(f) DNA sequences ^ 

(fa) comprising a nucleotide sequence encoding at least the mature forni of a 
protein (VBDPBP) comprising the amino acid sequence as given in SEQ 
ID NO: 12; 

(fb) comprising ihe nucleotide sequence as given in SEQ ID NO: 1 1 ; 

(fc) comprising a nucleotide sequence hybridizing with the .complementary 
strand of a nucleotide sequence as defined in (fa) or (fb) under stringent 
hybridization conditions; 

(fd) comprising an nucleotide sequence encoding a protein having an amino 
acid sequence at least 60 % identical to the amino acid sequence encoded 
by the nucleotide sequence of (fa) or (fb); 

(fe) comprising a nucleotide sequence encoding at least the domain binding to 
CDKs of the protein encoded by the nucleotide sequence of any one of 
{fa)to.(fd)w. 

(g) DNA sequences^ 

(ga) comprising*^ nucleotide^sequence encoding at least thie mature form of a 
protein (VBDAMP) comprising the amino acid sequence as given in SEQ 
ID N®ri4;^ 

(gb) comprising the nucleotide sequence as given in SEQ ID NO: 13; 

(gc) comprising a nucleotide sequence hybridizing with the complementary 
strand of a nucleotide sequence as defined in (ga) or (gb) under stringent 
hybridization conditions; 

(gd) conriprisihg an nucleotide seq^^ encoding a protein having an amino 
acid sequence at least 60 % identical to the amino acid sequence encoded 
by the nucleotide sequence of (ga) or (gb); 

(ge) comprising a nucleotide sequence encoding at least the domain binding to 
CDKs of the protein encoded by the nucleotide sequence of any one of 
(ga) to (gd); 

(h) DNA sequences, the nucleotide sequence of which is degenerate as a result of 
the genetic code to a nucleotide sequence of a DNA sequence as defined in any 
one of (a) to (g); and 



(i) DNA sequences comprising a nucleotide sequence encoding a fragment of a 
protein encoded by a DNA sequence of any one of (a) to (h), wherein said 
fragment is capable of interacting with a cell cycle protein. 

The term "cell cycle interacting protein" as denoted herein means a protein capable of 
binding to cyclin dependent kinases, in particular to CDC2a and/or CDC2b and 
preferably to plant cyclin dependent kinases or their subunits. 

The term "cell cycle" means the cyclic biochemical and structural events associated with 
growth and proliferation of cells, and in particular with the regulation of the replication of 
DNA and mitosis. The cycle is divided into periods called: Gq, Gap^ (G-,), DNA synthesis 
(S), Gap2 (G2), and mitosis (M). 

The term "proliferation" means growth and reproduction, i.e. division of cells. 
The term "cell division" means mitosis, i.e. the usual process of cell reproduction. 
The terms "gene", "polynucleotide", "nucleic acid sequence", "nucleotide sequence", 
"DNA sequence" or "nucleic acid molecule" as used herein refers to a polymeric form of 
nucleotides of any length, either ribonucleotides or deoxy ribonucleotides. This term 
refers only to the primary structure of the molecule. Thus, this tenm includes double- and 
single-stranded DNA, and RNA. It also includes known types of modifications, for 
example, methylation, "caps" substitution of one or more of the naturally occuring 
nucleotides with an analog. Preferably, the DNA sequence of the invention comprises a 
coding sequence encoding at least the mature form of the above defined cell cycl 
interacting protein, i.e. the protein which is posttranslationally processed in its 
biologically active form, for example due to cleavage of leader or secretory sequences 
or a proprotein sequence or other natural proteolytic cleavage points. 
By "functional fragment" and "biologically active form" polypeptides are meant that 
exhibit activity similar, but not necessarily identical, to an activity of the wild-type cell 
cycle interacting proteins of the invention or an activity that is enhanced over that of the 
wild-type proteins (either the full-length protein or, preferably, the mature protein), as 
measured in a particular biological assay. Assays of cell cycle interacting activity are 
disclosed, for example, in Example 1. below. These assays can be used to measure cell 
cycle interacting activity of partially purified or purified native or recombinant protein. 
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The cell cycle interacting protein of the Invention binds to CDC2. i.e. CDC2a and/or 
CDC2b, e.g.. from Arabidopsls. Thus, a polypeptide having a functional fragment or the 
"biological activity" of the cell cycle interacting protein of the invention will bind to CDCs 
as set forth In Example 1 . 

The term "immunologically active fragment" of a cell cycle interacting protein of the 
invention denotes proteins or peptides which have at least. a part of the primary 
structural conformation for one or more epitopes capable of reacting specifically with 
antibodies to a protein which is encodable by a nucleic acid molecule as set forth above. 
Preferably, the peptides and proteins encoded by a nucleic acid molecule of the 
invention are recognized by an antibody that specifically recognizes an epitope of th 
cell cycle interacting protein comprising the amino acid residues that are unique for th 
protein encoded by any one of the aforementioned DNA sequences. Preferably, said 
peptides and proteins are capable of eliciting an effective immune response in a 
mammal, for example mouse-or rabbit?* 

The DNA sequence which encodes for the predicted mature polypeptides of the proteins 
comprising SEQ ID NOS: 2. 4. 6. 8, 10. 12 or 14 or for the biologically active fragment 
thereof may Include: only the coding sequence for the mature polypeptide or for a 
biologically active fragment thereof; the coding seuence for the mature polypeptide or 
for a biologically active fragment thereof and additional coding sequence such as a 
leader or secretory sequence or a proprotein sequence; the coding sequence for the 
mature polypeptide (and optionally additional coding sequence) and non-coding 
sequence, such as intron or non-coding sequence 5' and/or 3* of the coding sequence 
forthe predicted mature polypeptide. 

A "coding sequence" is a nucleotide sequence which is transcribed into mRNA and/or 
translated into a polypeptide when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined by a translation 
start codon at the 5'-terminus and a translation stop codon at the 3'-terminus. A coding 
sequence can include, but is not limited to mRNA. cDNA. recombinant nucleotide 
sequences or genomic DNA, while introns may be present as well under certain 
circumstances. Thus, the nucleotide sequences of the present invention can be 
engineered in order to alter a cell cycle interacting protein coding sequence for a variety 



of reasons, including but not limited to, alterations which nnodify the cloning, processing 
and/or expression of the gene product. For example, mutations may be introduced using 
techniques which are well known in the art, e.g., site-directed mutagenesis to insert new 
restriction sites, to alter glycosylation patterns, to change codon preference, to produce 
splice variants, etc. 

In accordance with the present invention a two-hybrid system (Fields et al., Nature 340 
(1989). 245-246.) was exploited whereby CDC2aAt or CDC2bAt as bait and a cDNA 
library of a cell suspension as prey are used. Novel gene products interacting with 
CDC2aAt or CDC2bAt indicative of hitherto unknown plant cell cycle regulatory 
nucleotide sequences were identified. The library was made from a mixture mRNA from 
Arabidopsis thaliana cell suspensions harvested at various growing stages: eariy 
exponential, exponential, eariy stationary and stationary phase. 

Nine cDNA clones have been identified in accordance with the invention comprising the 
nucleotide sequences as depicted in SEQ ID NOS: 1, 3, 5, 7, 9, 11 and 13, which 
encode proteins that are capable of specifically interacting with cdc2aAt or cdc2bAt; see 
Examples 1 and 2, below. The proteins encoded by the cDNA clones comprised the 
amino acid sequences depicted in SEQ ID NOS: 2, 4, 6, 8, 10, 12 and 14. Computer 
assisted homology search in genome data bases revealed that novel genes have been 
identified and/or genes where the (partial) cDNA was described but the particular 
function of the gene remained unknown. In particular, the examples of the present 
invention demonstrate that novel cell cycle interacting proteins and their encoding genes 
have been identified. The possible applications of the these cell cycle interacting 
proteins and their encoding nucleic acid molecules will be discussed further below and 
are evident from the description provided in Example 3, 

The homology search was performed with the program BLASTX and BLASTN (version 
2.0a19MP-WashU [build decunix3.2 01:53:29 05-feb-1998] (see Altschul, Nucleic Acids 
Res, 25 (1997), 3389-3402) on the Arabidopsis thaliana nucleic acids database at 
ATDB at Stanford (http://genome-www2.stanford.edu/cgi-bin/AtDB/nph-blast2atdb). The 
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function GAP (general alignment ) (from the GCG 9.1 package. Genetics Computer 
Group Inc., Madison, USA) has been used with the parameters Gap weight =12 and 
Length weight = 4 to quantify the percentage of homology and similarity. The protein 
sequences were then used to perform a BLASTP (version 2.0.4 [feb-24-1998]) with 
BEAUTV post-processing provided by the Human Genome Center, Baylor College of 
Medicine against the National Center for Biotechnology Information's non-redundant 
protein database (http://dotjmgen.bcm.tmc.edu:9331/seq-search/protein-search.html). 
The results of the homology search are described in the appended examples. 

Thus it is evident that the genes comprising the nucleotide sequences of each SEQ ID 
NOS, 1, 3, 5, 7, 9, 11 and 13 each encode a member of a novel class of cell cycle 
interacting proteins. 

The present invention also relates to DNA sequences hybridizing with the above- 
described DNA sequences and differ in one or more positions in comparison with these 
as long as they encode a cell cycle interacting protein. By "hybridizing" it is meant that 
such nucleic acid molecules hybridize under conventional hybridization conditions, 
preferably under stringent conditions such as described by, e.g., Sambrook (Molecular 
Cloning; A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1989)). An example of one such stringent hybridization condition is 

hwhriHi-^atinn at ilY<s,QO af f^fy fr\[\r\\AiGfi Kw a \A/ocKinn in H 1 VQQ^ ad ^f^w r^ry^ 

hour. Altematively, an exemplary stringent hybridization condition Is in 50 % formamide, 
4XSSC at 42 **C: Cell cycle interacting proteins derived from other organisms such as 
mammals, in particular humans, may be encoded by other DNA sequences which 
hybridize to the sequences for plant cell cycle interacting proteins under relaxed 
hybridization conditions and which code on expression for peptides having the ability to 
interact with cell cycle proteins. Examples of such non-stringent hybridization conditions 
are 4XSSC at 50 **C or hybridization with 30-40 % formamide at 42 °C. Such molecules 
comprise those which are fragments, analogues or derivatives of the cell cycle 
interacting protein of the invention and differ, for example, by way of amino acid and/or 
nucleotide deletion(s), insertion(s), substitution(s), addition(s) and/or recombination(s) or 
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any other modification(s) known in the art either alone or in combination fronn the above- 
described amino acid sequences or their underlying nucleotide sequence(s). Using the 
Pestfind program (Rogers, Science 234 (1986), 364-368), PEST sequences (rich in 
proline, glutamic acid, serine, and threonine) can be identified, which are characteristically 
present in unstable proteins. Such sequences may be removed from the cell cycle 
interacting proteins in order to increase the stability and optionally the activity of the 
proteins. Methods for introducing such modifications in the nucleic acid molecules 
according to the invention are well-known to the person skilled In the art. The invention 
also relates to nucleic acid molecules the sequence of which differs from the nucleotid 
sequence of any of the above-described nucleic acid molecules due to the degeneracy 
of the genetic code. All such fragments, analogues and derivatives of the protein of th 
invention are included within the scope of the present invention, as long as the essential 
characteristic immunological and/or biological properties as defined above remain 
unaffected in kind, that is the novel nucleic acid molecules of the invention include all 
nucleotide sequences encoding proteins or peptides which have at least a part of the 
primary structural conformation for one or more epitopes capable of reacting with 
antibodies to cell cycle interacting proteins which are encodable by a nucleic acid 
molecule as set forth above and which have comparable or identical characteristics in 
terms of biological activity and/or the capability to interact with other proteins. It is 
preferred that proteins encoded by a nucleic acid molecule of the Invention are at least 
capable of interacting with CDC2, particulariy CDC2a and/or CDC2b, preferably from a 
plant such as Arabidopsis thaliana. Whilst the above described proteins may interact 
with^ CDC2~with CD nnost likeilylrtte with a 

CDC2 from the same species from which the gene was isolated (homologous 
interaction). This capability allows advantageous uses of the proteins of the invention 
and their encoding nucleic acid molecules as will be described in more detail below. 
Part of the invention is therefore also nucleic acid molecules encoding a polypeptide 
comprising at least a functional part of a cell cycle interacting protein encoded by a 
nucleic acid sequence comprised in a nucleic acid molecule according to the invention. 
An example for this is that the polypeptide or a fragment thereof according to the 
invention is embedded in another amino acid sequence. Preferably, the DNA sequence 
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of the invention encxxJes a protein having substantially the same amino acid sequence 
as the proteins defined in SEQ ID NOS 2, 4, 6, 8. 10, 12 and 14. 

Extending the polynucleotide sequence of the invention 

The polynucleotide sequences encoding the cell cycle interacting proteins may be 
extended utilizing partial nucleotide sequence and various methods known in the art to 
detect upstream sequences such as promoters and regulatory elements. Gobinda, 
(PGR Methods Applic. 2 (1993), 318-322) discloses "restriction-site" polymerase chain 
reaction (PGR) as a direct method which uses universal primers to retrieve unknown 
sequence adjacent to a known locus. First, genomic DNA is amplified in the presence of 
primer to a linker sequence and a primer specific to the known region. The amplified 
sequences are subjected to a second round of PGR with the same linker primer and 
another specific primer internal to the first one. Products of each round of PGR are 
transcribed with an appropriate RNA polymerase and sequenced using reverse 
transcriptase. 

Inverse PGR can be used to amplify or extend sequences using divergent primers 
based on a known region (Trigiia, Nucleic Acids Res. 16 (1988), 8186). The primers 
may be designed using OLIGO® 4.06 Primer Analysis Software (1992; National 
Biosciences Inc, Plymouth MN), or another appropriate program to be preferably 22-30 
nucleotides in length, to have a GG content of preferably 50% or more, and to anneal to 
the target sequence at temperatures preferably about 68''-72**G. The method uses 
several restriction enzymes to generate a suitable fragment in the known region of a 
gene. The fragment is then circularized by intramolecular ligation and used as a PGR 
template. 

Gapture PGR (Lagerstrom, PGR Methods Applic. 1 (1991), 111-119) is a method for 
PGR amplification of DNA fragments adjacent to a known sequence in, e.g., human or 
plant yeast artificial chromosome DNA. Gapture PGR also requires multiple restriction 
enzyme digestions and ligations to place an engineered double-stranded sequence into 
an unknown portion of the DNA molecule before PGR. 




Another method which may be used to retrieve unknown sequences is that of Parker, 
(Nucleic Acids Res. 19 (1991), 3055-3060). Additionally, one can use PGR, nested 
primers and PromoterFinder libraries to walk in genomic DNA (PromoterFinder™ 
Clontech (Palo Alto CA). This process avoids the need to screen libraries and is useful 
In finding intron/exon junctions. Preferred libraries for screening for full length cDNAs 
are ones that have been size-selected to include larger cDNAs. Also, random primed 
libraries are preferred in that they will contain more sequences which contain the 5* and 
upstream regions of genes. A randomly primed library may be particularty useful if an 
oligo d(T) library does not yield a full-length cDNA. Genomic libraries are useful for 
extension into the 5' nontranslated regulatory region. Capillary electrophoresis may be 
used to analyze the size or confirm the nucleotide sequence of sequencing or PGR 
products; see, e.g., Sambrook, supra. Systems for rapid sequencing are available from 
Perkin Elmer, Beckmann Instruments (Fullerton GA), and other companies. 

Computer-assisted identification of ceil cycle interacting proteins and their 
encoding genes 

As is further described in the appended examples BLAST2, which stands for Basic 
Local Alignment Search Tool (Altschul, 1997; Altschul. J. Mol. Evol. 36 (1993), 290-300; 
Aitschul, J. Mol. Biol. 215 (1990), 403-410), can be used to search for local sequenc 
alignments. BLAST produces alignments of both nucleotide and amino acid sequences 
to determine sequence similarity. Because of the local nature of the alignments, BLAST 
is especially useful in determining exact matches or in identifying homologs, Th 
fundamental unit of BLAST algorithm output is the High-scoring Segment Pair (HSP). 
An HSP consists of two sequence fragments of arbitrary but equal lengths whose 
alignment is locally maximal and for which the alignment score meets or exceeds a 
threshold or cutoff score set by the user. The BLAST approach is to look for HSPs 
between a query sequence and a database sequence, to evaluate the statistical 
significance of any matches found, and to report only those matches which satisfy the 
user-selected threshold of significance. The parameter E establishes the statistically 
significant threshold for reporting database sequence matches. E is interpreted as the 
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upper bound of the expected frequency of chance occurrence of an HSP (or set of 
HSPs) within the context of the entire database search. Any database sequence whose 
match satisfies E is reported in the program output. 

Analogous computer techniques using BLAST (Altschul. 1997, 1993 and 1990, supra) 
are used to search for identical or related molecules in nucleotide databases such as 
GenBank or EMBL. This analysis is much faster than multiple membrane-based 
hybridizations. In addition, the sensitivity of the computer search can be modified to 
determine whether any particular match is categorized as exact or homologous. The 
basis of the search is the product score which is defined as: 



and it takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will 
be exact within a 1-2% error, and at 70, the match will be exact. Homologous molecules 
are usually identified by selecting those which show product scores between 15 and 40, 
although lower scores may identify related molecules. 

Identifying derivatives, variants and homologs of the cell cycle interacting 
proteins of the invention 



developed in accordance with the present invention suitable for identifying cell cycle 
interacting proteins. Thus; in ano^^^ present invention relates to a method 

for identifying and obtaining cell cycle interacting proteins comprising a two-hybrid 
screening assay wherein CDC2a or CDC2b as a bait and a cDNA library of cell 
suspension culture as prey are used. Preferably, said CDC2a and CDC2b is CDC2aAt 
and CDC2bAt, respectively. However, CDKs or their corresponding subunits from other 
plants or other organisms such as mammals may be employed as well. The cell culture 
may be from any organism possessing cell cycle interacting proteins such as animals, 
preferably mammals. Particulariy prefen^ed are plant cell suspension cultures such as 
from Arabidopsis. The nucleic acid molecules encoding proteins or peptides identified to 
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interact with CDC2a or CDC2b in the above mentioned assay can be easily obtained 
and - sequenced by methods-known in the art; see- also the appended examples. 
Therefore, the present invention also relates to a DNA sequence encoding a cell cycle 
interacting protein obtainable by the method of the invention. 

In a preferred embodiment the nucleic acid molecules according to the invention are 
RNA or DNA molecules, preferably cDNA. genomic DNA or synthetically synthesized 
DNA or RNA molecules. Since cell cycle interacting proteins are supposed to play a key 
role in plant cell division, corresponding proteins displaying similar properties should be 
present in other organisms including mammals as well. Nucleic acid molecules of the 
invention can be obtained, e.g., by hybridization of the above-described nucleic acid 
molecules with a (sample of) nucleic acid molecule(s) of any source. Nucleic acid 
molecules hybridizing with the above-described nucleic acid molecules can in general 
be derived from any organism, preferably plants possessing such molecules, preferably 
from monocotyledonous or dicotyledonous plants, in particular from plants of interest in 
agriculture, horticulture or wood culture, such as crop plants, namely those of the family 
Poaceae, any starch producing plants, such as potato, maniok, leguminous plants, oil 
producing plants, such as oilseed rape, linenseed, etc., plants using polypeptide as 
storage substances, such as soybean, plants using sucrose as storage substance, such 
as sugar beet or sugar cane, trees, ornamental plants etc. Preferably, the nucleic acid 
molecules according to the invention are derived from crop plants (e.g. maize, rice, 
bariey, wheat, rye, oats etc.), potatoes, oil producing plants (e.g. oilseed rape, 
sunflower, peanut, soybean, etc.), cotton, sugar beet, sugar cane, leguminous plants 
(e.g. beans, peas etc.), and, of course, from Arabidopsis thaliana. Nucleic acid 
molecules hybridizing to the above-described nucleic acid molecules can be isolated, 
e.g., form libraries, such as cDNA or genomic libraries by techniques well known in the 
art. For example, hybridizing nucleic acid molecules can be identified and isolated by 
using the above-described nucleic acid molecules or fragments thereof or complements 
thereof as probes to screen libraries by hybridizing with said molecules according to 
standard techniques. Possible is also the isolation of such nucleic acid molecules by 
applying a nucleic acid amplicification technique such as the polymerase chain reaction 
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(PGR) using as primers oligonucleotides derived form the above-described nucleic acid 
molecules. 

Nucleic acid molecules which hybridize with any of the aforementioned nucleic acid 
molecules also include fragments, derivatives and allelic variants of the above- 
described nucleic acid molecules that encode a cell cycle interacting protein or an 
immunologically active or functional fragment thereof. Fragments are understood to be 
parts of nucleic acid molecules long enough to encode the described protein or a 
functional or immunologically active fragment thereof as defined above, 
The term "derivative" means in this context that the nucleotide sequence of these 
nucleic acid molecules differs from the sequences of the above-described nucleic acid 
molecules in one or more nucleotide positions and are highly homologous to said 
nucleic acid molecules. Homology is understood to refer to a sequence identity of at 
least 40 particulariy-an identity^0fcat««;leastK6G^^^% 80 % and still 

more preferably more* than 90*^ %?* The«sterm -^"substantially^^homologous" refers to a 
subject, for instance^a nucleic acid^^whieh is at ^least\50% identical in sequence to the 
reference when the entire ORF (open reading frame) is eomparedv;where the sequence 
identity is preferably at least 70%, more preferably at least 80%>stilkmore preferably at 
least 85%, especially more than about 90%, most preferably 95% or greater, particulariy 
98% or greater. The deviations from the sequences of the nucleic acid molecules 
described above can, for example, be the result of nucleotide substitution (s), deletion(s), 
addition(s), insertion(s) and/or recombination(s); see supra. 

Homology further means that the respective nucleic acid molecules or encoded proteins 
are functionally and/or structurally equivalent. The nucleic acid molecules that are 
homologous to the nucleic acid molecules described above and that are derivatives of 
said nucleic acid molecuies are, for example, variations of said nucleic acid molecules 
which represent modifications having the same biological function, in particular 
encoding proteins with the same or substantially the same biological function. They may 
be naturally occurring variations, such as sequences from other plant varieties or 
species, or mutations. These nriutations may occur naturally or may be obtained by 



mutagenesis techniques. The allelic variations may be naturally occuning allelic variants 
as well as synthetically produced or genetically engineered variants; see supra. 
The proteins encoded by the various derivatives and variants of the above-described 
nucleic acid molecules may share specific common characteristics, such as biological 
activity, molecular weight, immunological reactivity, conformation, etc., as well as 
physical properties, such as electrophoretlc mobility, chromatographic behavior, 
sedimentation coefficients, pH optimum, temperature optimum, stability, solubility, 
spectroscopic properties, etc. 

Examples of the different possible applications of the nucleic acid molecules according 
to the invention as well as molecules derived from them will be described in detail in the 
following. 

Uses of the nucleic acfd molecules of the present invention 

In one embodiment, the present invention relates to a nucleic acid molecule which 
hybridizes with the complementary strand of the nucleic acid molecule of the invention 
and which encodes a muiated version of the protein as defined above which has lost its 
immunological and/or biological activity. This embodiment may prove useful for, e.g., 
generating dominant mutant alleles of the above-described cell cycle interacting 
proteins. Said mutated version is preferably generated by substitution, deletion and/or 
addition of 1 to 5 or 5 to 10 amino acid residues in the amino acid sequence of th 
above-described wild type proteins. 

In a further embodiment, the invention relates to nucleic acid molecules of at least 15 
nucleotides in length hybridizing specifically with a nucleic acid molecule as described 
above or with a complementary strand thereof. Specific hybridization occurs preferably 
under stringent conditions and implies no or very little cross-hybridization with 
nucleotide sequences encoding no or substantially different proteins. Such nucleic acid 
molecules may be used as probes and/or for the control of gene expression. Nucleic 
acid probe technology is well known to those skilled in the art who will readily appreclat 
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that such probes may vary in length. Preferred are nucleic acid probes of 16 to 35 
nucleotides in length. Of course, it may also be appropriate to use nucleic acids of up to 
100 and more nucleotides in length! The nucleic acid probes of the invention are useful 
for various applications. On the one hand, they may be used as primers for amplification 
of nucleic acid sequences according to the invention. The design and use of said 
primers is known by4he person skilled in the art. Preferably such amplification primers 
comprise a contiguous sequence of at least 6 nucleotides, in particular 13 nucleotides, 
preferably 15 to 25 nucleotides or more, identical or complementary to the nucleotide 
sequence depicted in SEQ ID NOS: 1. 3, 5. 7. 9, 11 or 13 or to a nucleotide sequence 
encoding the amino acid sequence of SEQ ID NOS: 2, 4, 6. 8, 10, 12 or 14. Another 
application is the use as a hybridization probe to identify nucleic acid molecules 
hybridizing with a nucleic acid molecule of the Invention by homology screening of 
genomic DNA or cDNA libraries. Nucleic acid molecules according to this preferred 
embodiment of the* invention which are complementary to- a nucleic acid molecule as 
described above may also be used for repression of expression of a cell cycle gene, for 
example due to an antisense or triple helix effect or for the constnjction of appropriate 
ribozymes (see. e.g.. EP-A1 0 291 533. EP-A1 0 321 201. EP-A2 0 360 257) which 
specifically cleave the (pre)-mRNA of a gene comprising a nucleic acid molecule of the 
invention or part thereof. Selection of appropriate target sites and corresponding 
ribozymes can be done as described, for example, in Steinecke, Ribozymes. Methods in 
Cell Biology 50, Galbraith et al. eds Academic Press. Inc. (1 995), 449-460. The above 
described nucleic acid molecules may either be DNA or RNA or a hybrid thereof. 
Furthermore, said nucleic acid hiolecule may contain, for example, thioester bonds 
and/or nucleotide analogues, commonly used in oligonucleotide anti-sense approaches. 
Said modifications may be useful for the stabilization of the nucleic acid molecule 
against endo- and/or exonucleases in the cell. Said nucleic acid molecules may be 
transcribed by an appropriate vector containing a chimeric gene which allows for the 
transcription of said nucleic acid molecule in the cell. 

Furthermore, the person skilled in the art is well aware that it is also possible to label 
such a nucleic acid probe with an appropriate mari<er for specific applications, such as 
for the detection of the presence of a nucleic acid molecule of the invention in a sample 




derived from an organism, in particular plants. A number of companies such as 
Pharmacia Biotech (Pisa^taway NJ), Promega (Madison Wl), and US Biochemical Corp 
(Cleveland OH) supply commercial kits and protocols for these procedures. SuitabI 
reporter molecules or labels include those radionuclides, enzymes, fluorescent, 
chemiluminescent, or chromogenic agents as well as substrates, cofactors, inhibitors, 
magnetic particles and the like. Patents teaching the use of such labels include US 
Patents US-A-3,81 7,837; US.A-3.850,752; US-A-3,939,350; US-A-3, 996,345; US-A- 
4,227,437; US-A-4.275,149 and 4,366,241. Also, recombinant immunoglobulins may be 
produced as shown in US-A-4,81 6,567 incorporated herein by reference. 
Furthermore, the so-called "peptide nucleic acid" (PNA) technique can be used for the 
detection or inhibition of the expression of a nucleic acid molecule of the invention. For 
example, the binding of PNAs to complementary as well as various single stranded RNA 
and DNA nucleic acid molecules can be systematically investigated using thermal 
denaturation and BIAcore surface-interaction techniques (Jensen, Biochemistry 36 
(1997), 5072-5077). Furthermore, the nucleic acid molecules described above as well 
as PNAs derived therefrom can be used for detecting point mutations by hybridization 
with nucleic acids obtained from a sample with an affinity sensor, such as BIAcore; see 
Gotoh, Rinsho Byori 45 (1997), 224-228. Hybridization based DNA screening on peptide 
nucleic acids (PNA) oligomer arrays are described in the prior art, for example in Weiler, 
Nucleic Acids Research 25 (1997), 2792-2799. The synthesis of PNAs can be 
perfomned according to methods known in the art, for example, as described in Koch, J. 
Pept. Res. 49 (1997), 80-88; Finn, Nucleic Acids Research 24 (1996), 3357-3363. 
Further possible applications of such PNAs, for example as restriction enzymes or as 
templates for the synthesis of nucleic acid oligonucleotides are known to the person 
skilled in the art and are, for example, described in Veselkov, Nature 379 (1996), 214 
and Bohler, Nature 376 (1 995), 578-581 . 

Detection and mapping of related polynucleotide sequences 

The nucleic acid sequence for a cell cycle interacting protein of the invention can also 
be used to generate hybridization probes for mapping the naturally occurring genomic 
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sequence. The sequence may be mapped to a particular chromosome or to a specific 
region of the chromosome using well known techniques. These include in situ 
hybridization to chromosomal spreads, flow-sorted chromosonflal preparations, or 
artificial chromosome constructions such as yeast artificial chromosomes, bacterial 
artificial chromosomes, bacterial P1 constructions or single chromosome cDNA libraries 
as reviewed in Rrice (BloQdr.Rev. 7 (1993), 127^134) and Trask*(Trends Genet. 7 (1991), 
149-154). The technique of fluorescent in situ hybridization of chromosome spreads has 
been described, among other places, in Verma, (1988) Human Chromosomes: A 
Manual of Basic Techniques, Pergamon Press, New York NY, Fluorescent in situ 
hybridization of chromosomal preparations and other physical chromosome mapping 
techniques may be correlated with additional genetic map data. Examples of genetic 
map data can be found in the 1994 Genome Issue of Science (265:1 981 f) and Meinke, 
Science 282 (1998), 662-682. Correlation between the location of the gene encoding a 
cell cycle interacting protein of the invention on a physical chromosomal map and a 
specific feature, e.g., plant growth, architecture, yield, disease etc. may help delimit the 
region of DNA associated with this feature. The nucleotide sequences of the subject 
invention may be used to detect differences in gene sequences between normal, carrier 
or affected individuals. Furthermore, the means and methods described herein can be 
used for marker-assisted breeding. 

In sftu hybridization of chrom.osomal preparations and physical mapping techniques 
such as linkage analysis using established chromosomal markers may be used for 
extending genetic maps. For example an sequence taggied site based map of the 
human genome was recently published by the Whitehead-MIT Center for Genomic 
Research (Hudson, Science 270 (1995), 1945-1954) on a map of the plant genome by 
way of the Arabidopsis genome is available from http://genome.wwz.Stanford.edu/cgi- 
bin/AtDB/nph-blast2atdb. Often the placement of a gene on the chromosome of another 
species may reveal associated marker even if the number or arm of a particular 
chromosome is not known. New sequences can be assigned to chromosomal arms, or 
parts thereof, by physical mapping. This provides valuable information to investigators 
searching for interacting genes using positional cloning or other gene discovery 



techniques. Once such gene has been caidely localized by genetic linkage to a 
particular genomic region, any sequences mapping to that area may represent 
associated or regulatory genes for further investigation. The nucleotide sequence of th 
subject invention may also be used to detect differences in the chromosomal location 
due to translocation, inversion, etc. among normal, cam'er or affected individuals. 

Vectors and expression systems 

The present invention also relates to vectors, particularly plasmids, cosmids, viruses, 
bacteriophages and other vectors used conventionally in genetic engineering that 
contain a nucleic acid molecule according to the invention. Methods which are well 
known to those skilled in the art can be used to construct various plasmids and vectors; 
see, for example, the techniques described in Sambrook, Molecular Cloning A 
Laboratory Manual, Cold Spring Harbor Laboratory (1989) N.Y. and Ausubel, Current 
Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, 
N.Y. (1989), (1994). Plasmids and vectors to be preferably employed in accordance with 
the present invention include those well known in the art. Alternatively, the nucleic acid 
molecules and vectors of the Invention can be reconstituted into liposomes for delivery 
to target cells. 

In a preferred embodiment the nucleic acid molecule present in the vector is linked to 
(a) control sequence(s) which allow the expression of the nucleic acid molecule in 
prokaryotic and/or eukaryotic cells. 

The term "control sequence" refers to regulatory DNA sequences which are necessary 
to effect the expression of coding sequences to which they are ligated. The nature of 
such control sequences differs depending upon the host organism. In prokaryotes, 
control sequences generally include promoter, ribosomal binding site, and terminators. 
In eukaryotes generally control sequences include promoters, terminators and. In som 
instances, enhancers, transactivators or transcription factors. The term "control 
sequence" is intended to include, at a minimum, all components the presence of which 
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are necessary for expression, and may also include additional advantageous 
components. 

The term "operably^'linkey" refers to a juxtaposition wherein the components so 
described are in a relationship permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence is ligated in such a way that 
expression of the coding sequence is achieved under conditions compatible with the 
control sequences. In case the control sequence is a promoter, it is obvious for a skilled 
person that double-stranded nucleic acid is preferably used. 

Thus, the vector of the invention is preferably an expression vector. An "expression 
vector" is a construct that can be used to transform a selected host cell and provides for 
expression of a coding sequence in the selected host. Expression vectors can for 
instance be cloning vectors, binary vectors or integrating vectors. Expression compris s 
transcription of the nucleic acid molecule preferably into a translatable mRNA. 
Regulatory elements ensuring expression in prokaryotic and/or eukaryotic cells are well 
known to those skilled in the art. In the case of eukaryotic cells they comprise normally 
promoters ensuring initiation of transcription and optionally poly-A signals ensuring 
termination of transcription and stabilization of the transcript, for example in plants, 
those of the 35S RNA from Cauliflower Mosaic Virus (CaMV). Other promoters 
commonly used are the polyubiquitin promoter, and the actin promoter for ubiquitous 
expression. The termination signals usually employed are from the Nopaline Synthase 
promoter or from the CAMV 35S promoter A plant translational enhancer often used is 
the TMV omega sequences, the inclusion of an intron (lntron-1 from the Shrunken gene 
of maize, for example) has been shown to increase expression levels by up to lOOrfold. 
(Mait, Transgenic Research 6 (1997), 143-156; Ni, Plant Joumal 7 (1995), 661-676), 
Additional regulatory elements may include transcriptional as well as translational 
enhancers. Possible regulatory elements permitting expression in prokaryotic host cells 
comprise, e.g., the Pl. /ac, trp or tac promoter in E. coli, and examples of regulatory 
elements permitting expression in eukaryotic host cells are the AOX1 or GAL1 promoter 
in yeast or the CMV-, SV40-, RSV-promoter (Rous sarcoma virus), CMV-enhancer, 
SV40-enhancer or a globin intron in mammalian and other animal cells. In this context, 
suitable expression vectors are known in the art such as Okayama-Berg cDNA 



expression vector pcDV1 (Pharmacia), pCDM8, pRc/CMV, pcDNAI, pcDNA3 (In- 
vitrogene), pSPORTI (GIBCO BRL). An alternative expression system which could be 
used to express a cell cycle interacting protein is an insect system. In one such system, 
Autographs califomica nuclear polyhedrosis vims (AcNPV) is used as a vector to 
express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae. Th 
coding sequence of a nucleic acid molecule of the invention may be cloned into a 
nonessential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of said coding sequence will render the 
polyhedrin gene inactive and produce recombinant virus lacking coat protein coat. The 
recombinant viruses are then used to infect S. frugiperda cells or Trichoplusia larvae in 
which the protein of the invention is expressed (Smith, J. Virol. 46 (1983), 584; 
Engelhard. Proc. Nat. Acad. Sci. USA 91 (1994). 3224-3227). 

Advantageously, the above-described vectors of the invention comprises a selectable 
and/or scorable marker. Selectable marker genes useful for the selection of transformed 
plant cells, callus, plant tissue and plants are well known to those skilled in the art and 
comprise, for example, antimetabolite resistance as the basis of selection for dhfr, which 
confers resistance to methotrexate (Reiss, Plant Physiol. (Life Sci. Adv.) 13 (1994), 143- 
149); npt, which confers resistance to the aminoglycosides neomycin, kanamycin and 
paromycin (Hen^era-Estrella. EMBO J. 2 (1983), 987-995) and hygro, which confers 
resistance to hygromycin (Marsh, Gene 32 (1984), 481-485). Additional selectable 
genes have been described, namely trpB, which allows cells to utilize indole in place of 
tryptophan; hisD. which allows cells to utilize histinol in place of histidine (Hartman, 
Proc. Natl. Acad. Sci. USA 85 (1988), 8047); mannose-6-phosphate isomerase which 
allows cells to utilize mannose (WO 94/20627) and ODC (ornithine decarboxylase) 
which confers resistance to the ornithine decarboxylase inhibitor, 2-(difluoromethyl)-DL- 
omithine. DFMO (McConlogue, 1987, In: Current Communications in Molecular Biology, 
Cold Spring Harbor Laboratory ed.) or deaminase from Aspergillus terreus which 
confers resistance to Blasticidin S (Tamura, Biosci. BiotechnoL Biochem. 59 (1995), 
2336-2338). 

Useful scorable marker are also known to those skilled in the art and are commercially 
available. Advantageously, said marker is a gene encoding luciferase (Giacomin, PI. 



26 

ScL 116 (1996), 59-72; Scikantha, J. Bact 178 (1996), 121). green fluorescent protein 
(Gerdes, FEBS Lett. 389 (1996), 44-47) or B-glucuronidase (Jefferson, EMBO J. 6 
(1987), 3901-3907). This embodiment is particularfy useful for simple and rapid 
screening of cells, tissues and organisms containing a vector of the invention. 

The present invention furthermore relates to host cells comprising a vector as described 
above or a nucleic acid molecule according to the invention wherein the nucleic acid 
molecule is foreign to the host cell. 

By "*foreign" it is meant that the nucleic acid molecule is either heterologous with respect 
to the host cell, this means derived from a cell or organism with a different genomic 
background, or is homologous with respect to the host cell but located in a different 
genomic environment than the naturally occum*ng counterpart of said nucleic acid 
molecule. This means that, if the nucleic acid molecule is homologous with respect to 
the host cell, it is not located in its natural location in the genome of said host cell, in 
particular it is surrounded^by different genes^ In this case the nucleic acid molecule may 
be either under the-^eontrol of its own promoter or under the controLof a heterologous 
promoter. The vector *or nucleic acid ^molecule -according to the invention which is 
present in the host cell may either-.be integrated into the genome^of the host cell or it 
may be maintained in some form extrachromosomaily. In this respect, it Is also to be 
understood that the nucleic acid molecule of the invention can be used to restore or 

Recombination and Gene Silencing in Plants. Kluwer Academic Publishers (1994)). 
The host cell can such as bacterial, insect, fungal, 

plant or animal cells. Preferred fungal cells are, for example, those of the genus 
Saccharomyces, in particular those of the species S. cerevisiae. 

The term "prokaryotic" is meant to include all bacteria which can be transformed or 
transfected with a DNA or RNA molecules for the expression of a protein of the 
invention. Prokaryotic hosts may include gram negative as well as gram positive 
bacteria such as. for example, E. coli, S. typhimurium, Serratia marcescens and Bacillus 
subtilis. The term "eukaryotic" is meant to include yeast, higher plant, insect and 
preferably mammalian cells. Depending upon the host employed in a recombinant 
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production procedure, the protein encoded by the polynucleotide of the present 
invention may be glycosylated or may be non-glycosylated. The cell cycle interacting 
proteins of the invention may or may not also include an initial methionine amino acid 
residue. A polynucleotide of the invention can be used to transform or transfect the host 
using any of the techniques commonly known to those of ordinary skill in the art. 
Furthermore, methods for preparing fused, operably linked genes and expressing them 
in, e.g., mammalian cells and bacteria are well-known In the art (Sambrook, Molecular 
Cloning: A Laboratory Manual, Cold Spring Haribor Laboratory, Cold Spring Harbor, NY, 
1989). 

Another subject of the invention is a method for the preparation of cell cycle interacting 
proteins which comprises the cultivation of host cells according to the invention which, 
due to the presence of a vector or a nucleic acid molecule according to the invention, 
are able to express such a protein, under conditions which allow expression of the 
protein and recovering of the so-produced protein from the culture. It is also to be 
understood that the proteins can be expressed in a cell free system using for example in 
vitro translation assays known in the art. 

The term "expression" means the production of a protein or nucleotide sequence in the 
cell. However, said term also includes expression of the protein in a cell-free system. It 
includes transcription into an RNA product, post-transcriptional modification and/or 
translation to a protein product or polypeptide from a DNA encoding that product, as 
well as possible post-translational modifications. Depending on the specific constructs 
and conditions used, the protein may be recovered from the cells, from the culture 
medium or from both. The terms "protein" and "polypeptide" used in this application are 
interchangeable. "Polypeptide" refers to a polymer of amino acids (amino acid 
sequence) and does not refer to a specific length of the molecule. Thus peptides and 
oligopeptides are included within the definition of polypeptide. This term does also refer 
to or include post-translational modifications of the polypeptide, for example, 
glycosylations, acetylations, phosphorylations and the like. Included within the definition 
are, for example, polypeptides containing one or more analogs of an amino acid 
(including, for example, unnatural amino acids, etc.), polypeptides with substituted 
linkages, as well as other modifications known in the art, both naturally occurring and 
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non-naturally occurring. For example, it is well known by the person skilled in the art that 
it is not only possible to express a native protein but also to express the protein as 
fusion polypeptides or to add signal sequences directing the protein to specific 
compartments of the host cell, e.g., ensuring secretion of the protein into the culture 
medium, etc. The protein of the invention may also be expressed as a recombinant 
protein with one or more additional polypeptide domains added to facilitate protein 
purification. Such purification facilitating domains include, but are not limited to, metal 
chelating peptides such as histidine-tryptophan modules that allow purification on 
immobilized metals, protein A domains that allow purification on immobilized 
immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification 
system (Immunex Corp, Seattle WA). The inclusion of a cleavable linker sequences 
such as Factor XA or enterokinase (Invitrogen, San Diego CA) between the purification 
domain and the protein of interest is useful to facilitate purification. One such expression 
vector provides for expression of ia fusion protein compromising a cell cycle interacting 
protein and contains mucleic acidssencoding 6 histidine residues followed by thioredox'm 
and an enterokinase cleavage site*4jhe histidine-residues4acilitate-purification on IMIAC 
(immobilized metal ion affinity chromatography^ as described in Porath, Protein 
Expression and Purification 3 (1992), 263-281) while 4he enterokinase cleavage site 
provides a means for purifying the cell cycle interacting protein from the fusion protein. 
In addition to recombinant production, fragments of the protein of the invention may be 
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(1969) Solid Phase Peptide Synthesis, WH Freeman Co, San Francisco; Merrifield, J. 
Am: Cher^^^^^^^^ 2149-2154); //I \//fro protein synthesis may be performed 

using manual techniques or by automation. Automated synthesis may be achieved, for 
example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin Elmer, Foster 
City CA) in accordance with the instructions provided by the manufacturer. Various 
fragments of the cell cycle interacting protein of the invention may be chemically 
synthesized and/or modified separately and combined using chemical methods to 
produce the full length molecule. Once expressed or synthesized, the protein of the 
present invention can be purified according to standard procedures of the art, including 
ammonium sulfate precipitation, affinity columns, column chromatography, gel 



electrophoresis and the like; see, Scopes, "Protein Purification", Springer-Verlag, N.Y. 
(1982). Substantially pure proteins of at least about 90 to 95% homogeneity are 
preferred, and 98 to 99% or more homogeneity are most preferred, for pharmaceutical 
uses. Once purified, partially or to homogeneity as desired, the proteins may then be 
used therapeutically (including extracorporeally) or in developing and performing assay 
procedures. 

Cell cycle interacting proteins of the invention 

The present invention furthermore relates to cell cycle interacting proteins encoded by 
the nucleic acid molecules according to the invention or produced or obtained by the 
above-described methods, and to functional and/or immunologically active fragments of 
such cell cycle interacting proteins. The proteins and polypeptides of the present 
invention are not necessarily translated from a designated nucleic acid sequence; the 
polypeptides may be generated in any manner, including for example, chemical 
synthesis, or expression of a recombinant expression system, or isolation from a 
suitable viral system. The polypeptides may include one or more analogs of amino 
acids, phosphorylated amino acids or unnatural amino acids. Methods of inserting 
analogs of amino acids into a sequence are known in the art. The polypeptides may 
also include one or more labels, which are known to those skilled in the art. In this 
context, it is also understood that the proteins according to the invention may be further 
modified by conventional methods known in the art. By providing the proteins according 
to the present invention it is also possible to determine fragments which retain biological 
activity. This allows the construction of chimeric proteins and peptides comprising an 
amino sequence derived from the protein of the invention, which is crucial for its, e.g., 
binding activity and other functional amino acid sequences, e.g. GUS marker gene 
(Jefferson, EMBO J. 6 (1987), 3901-3907), The other functional amino acid sequences 
may be either physically linked by, e.g., chemical means to the proteins of the invention 
or may be fused by recombinant DNA techniques well known in the art. 
The term "fragment of a sequence" or "part of a sequence" means a truncated 
sequence of the original sequence referred to. The truncated sequence (nucleic acid or 
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protein sequence) can vary widely in length; the minimum size being a sequence of 
sufficient size to provide a sequence with at least a comparable function and/or activity 
of the original sequence referred to, while the maximOm size Ts not criticaK In some 
applications, the maximum size usually is not substantially greater than that required to 
provide the desired activity and/or function(s) of the original sequence. Typically, the 
truncated amino acid sequence will range from about 5 to about 60 amino acids in 
length. More typically, however, the sequence will be a maximum of about 50 amino 
acids in length, preferably a maximum of about 30 amino acids. It is usually desirable to 
select sequences of at least about 10, 12 or 15 amino acids, up to a maximum of about 
20 or 25 amino acids. 

Furthermore, folding simulations and computer redesign of structural motifs of the protein 
of the invention can be performed using appropriate computer programs (Olszewski, 
Proteins 25 (1996), 286-299; Hoffman, Comput. Appl. Biosci. 11 (1995), 675-679). 
Computer modeling of protein folding can be used for the conformational and energetic 
analysis of detailed peptide and protein models (Monge, J. Mol. BioL 247 (1995), 995- 
1012; Renouf. Adv. Exp. Med. Biol. 376 (1995), 37-45). In particular, the appropriate 
programs can be used for the identification of interactive sites of the cell cycle interacting 
protein and its receptor, its ligand or other interacting proteins by computer assistant 
searches for complementary peptide sequences (Fassina, Immunomethods 5 (1994), 
1 14-120. Further appropriate computer systems for the design of protein and peptides are 
described in the prior art, for example in Berry, Biochem. Soc. Trans. 22 (1994), 1033- 
1036; Wodak, Ann. N. Y. Acad. Sci. 501 (1987), 1-13; Pabo, Biochemistry 25 (1986). 
5987-5991. The results obtained from the above<Jescribed computer analysis can be 
used for, e.g., the preparation of peptide mimetics of the protein of the invention or 
fragments thereof. Such pseudopeptide analogues of the natural amino acid sequence of 
the protein may very efficiently mimic the parent protein (Benkirane, J. Biol. Chem. 271 
(1996), 33218-33224). For example, incorporation of easily available achiral (-amino acid 
residues into a protein of the invention or a fragment thereof results in the substitution of 
amide bonds by polymethylene units of an aliphatic chain, thereby providing a convenient 
strategy for constmcting a peptid mimetic (Baneqee, Biopolymers 39 (1996), 769-777). 
Superactive peptidomimetic analogues of small peptide hormones in other systems are 




described in the prior ar* (Zhang, Biochem. Biophys. Res. Commun. 224 (1996), 327- 
331 ). Appropriate peptide mlmetics of the protein of the present invention can also be 
identified by the synthesis of peptide mimetic combinatorial libraries through successive 
amide alkylation and testing the resulting compounds, e.g., for their binding and 
immunological properties. Methods for the generation and use of peptidomimetic 
combinatorial libraries are described in the prior art, for example in Ostresh, Methods in 
Enzymology 267 (1996), 220-234 and Domer. Bioorg. Med. Chem. 4 (1996), 709-715. 
Furthermore, a three-dimensional and/or crystallographic stmcture of the protein of the 
Invention can be used for the design of peptide mimetic inhibitors of the biological activity 
of the protein of the invention (Rose, Biochemistry 35 (1996). 12933-12944; Rutenber. 
Bioorg. Med. Chem. 4 (1996). 1545-1558). 

Antibodies 

Furthermore, the present invention relates to antibodies specifically recognizing a cell 
cycle interacting protein according to the invention or parts, i.e. specific fragments or 
epitopes, of such a protein. The antibodies of the invention can be used to identify and 
isolate other cell cycle interacting proteins and genes in any organism, preferably plants. 
These antibodies can be monoclonal antibodies, polyclonal antibodies or synthetic 
antibodies as well as fragments of antibodies, such as Fab, Fv or scFv fragments etc. 
Monoclonal antibodies can be prepared, for example, by the techniques as originally 
described in Kohler and Milstein, Nature 256 (1975), 495, and Galfre, Meth. Enzymol. 73 
(1981), 3, which comprise the fusion of mouse myeloma cells to spleen cells derived from 
immunized mammals. Furthermore, antibodies or fragments thereof to the 
aforementioned peptides can be obtained by using methods which are described, e.g., 
in Harlow and Lane "Antibodies, A Laboratory Manual", CSH Press, Cold Spring Harbor. 
1988. These antibodies can be used, for example, for the immunoprecipitation and 
immunolocalization of proteins according to the invention as well as for the monitoring of 
the synthesis of such proteins, for example, in recombinant organisms, and for the 
identification of compounds interacting with the protein according to the invention. For 
example, surface plasmon resonance as employed in the BIAcore system can be used to 
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increase the efficiency of phage antibodies selections, yielding a high increment of affinity 
from a single library of phage antibodies which bind to an epitope of the protein of the 
invention (Schier, Human- Antibodies Hybridorri^s 7 (1 996), 97-1 05; Malmborg, J, 
Immunol. Methods 183 (1995), 7-13). In many cases, the binding phenomena of 
antibodies to antigens is equivalent to other ligand/anti-ligand binding. 

Transgenic plants 

Plant cell division can conceptually be influenced in three ways : (i) inhibiting or 
arresting cell division, (ii) maintaining, facilitating or stimulating cell division, (iii) 
uncoupling DNA synthesis from mitosis and cytokinesis or (iv) uncoupling cell division 
from intrinsic developmental or extemal environmental conditions. Modulation of the 
expression of a cell cycle interacting protein encoded by a nucleotide sequence 
according to the invention- has surprisingly an advantageous influence on plant cell 
division characteristics, in particular on the disruption of the expression levels of genes 
involved in G1/S and/or G2/M transition-and as a result thereof on the total make-up of 
the plant concemed or parts thereof. An example is that DNA synthesis, progression of 
DNA replication or mitoses wilh be negatively influenced by interfering with the formation 
of a cyclin-dependent protein kinase complex. Alternatively, overexpression of the cell 
cycle interacting protein accelerates reentry into the cell cycle. 

The term "cyclin-dependent protein kinase complex" means the complex form.ed when 
a, preferably functional, cyclin associates with a, preferably, functional cyclin dependent 
kinase. Such complexes may be active in phosphorylating proteins and may or may not 
contain additional protein species. 

The term "protein kinase" means an enzyme catalyzing the phosphorylation of proteins. 

To analyse the industrial applicabilities of the invention, transformed plants can be made 
overproducing the nucleotide sequence according to the invention. Such an 
overexpression of the new gene(s), proteins or inactivated variants thereof will either 
positively or negatively have an effect on cell division. Methods to modily the expression 
levels and/or the activity are known to persons skilled in the art and include for instance 



overexpression, co-suppression, the use of ribozymes, sense and anti-sense strategies, 
gene silencing approaches. "Sense strand" refers to the strand of a double-stranded 
DNA molecule that is homologous to a mRNA transcript thereof. The "anti-sense strand" 
contains an inverted sequence which is complementary to that of the "sense strand". 

Hence, the nucleic acid molecules according to the invention are in particular useful for 
the genetic manipulation of plant cells in order to modify the characteristics of plants and 
to obtain plants with modified, preferably with improved or useful phenotypes. Similarty, 
the invention can also be used to modulate the cell division and the growth of cells, 
preferentially plant cells, in in vitro cultures. Specifically the plant cell division rate and/or 
the inhibition of a plant cell division can be influenced by overexpression or reducing the 
expression of a gene encoding a protein according to the invention. Overexpression of a 
cell cycle interacting protein encoding gene according to the invention promotes cell 
proliferation, while reducing gene expression arrests cell division or prevents reentry 
into the cell cycle. Part of the invention is thus the usage of the nucleic acid molecules 
as mentioned hereinbefore as a negative or positive regulator of cell proliferation. As a 
result of overproduction the G1/S generation time is shortened whereas the proliferation 
is less dependent on growth factors. A transformed plant can thus be obtained by 
transforming a plant cell with a gene encoding a polypeptide concerned or fragment 
thereof alone or in combination. For this purpose tissue specific promoters, in one 
construct or being present as a separate construct in addition to the sequence 
concerned, can be used. Altematively the expression of the cell cycle interacting protein 
is inducible by cytokinines or sucrose. Surprisingly using a polypeptide or fragment 
thereof according to the invention or using antisense RNA for the gene according to th 
invention cell division of the meristems of the plant can be manipulated, positively 
and/or negatively respectively. Furthermore, overproduction of the cell cycle interacting 
protein of the invention enhances growth and results in cell division to be less sensitive 
to an arrest caused by environmental stress such as salt, drought, chilling and the like. 
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Thus, the present invention relates to a method for the production of transgenic plants, 
plant cells or plant tissue comprising the introduction of a nucleic acid molecule or 
vector of the invention intdlh^^gehoritS'of said ]SIant, plant dell or'plant tissue. 

For the expression of the nucleic acid molecules according to the invention in sense or 
antisense orientation in plant cells, the molecules are placed under the control of 
regulatory elements which ensure the expression In plant cells. These regulatory elements 
may be heterologous or homologous with respect to the nucleic acid molecule to be 
expressed as well with respect to the plant species to be transformed. In general, such 
regulatory elements comprise a promoter active in plant cells. To obtain expression In all 
tissues of a transgenic pfant, preferably constitutive promoters are used, such as the 35 S 
promoter of CaMV (Odell. Nature 313 (1985), 810-812) or promoters of the polyublquitin 
genes of maize (Christensen, Plant Mol. Biol. 18 (1982), 675-689). In order to achiev 
expression in specific tissues of a transgenic plant it is possible to use tissue specific 
promoters (see, e.g., SJockhaus, EMBO J. 8 (1989), 2245-225 1). Known are also 
promoters which are specrfically active in tubers of potatoes or in seeds of different plants 
species, such as maize, Vicia, wheat, bariey etc. Inducible promoters may be used In 
order to be able to exactly control expression. An example for inducible promoters are the 
promoters of genes encoding heat shock proteins. Also microspore-specific regulatory 
elements and their uses have been described (W096/16182)- Furthermore, the 
chemicallv inducible Tet-svstem mav be emnloved /Gate. Mol. Gen. Genet^ 
229-237). Further suitable promoters are known to the person skilled in the art and are 
described, e.g., in Ward {Plant Mol. Biol. 22 (1993), 361-366). The regulatory elements 
may further comprise transcriptional and/or transfational enhancers functional in plants 
cells. Furthemiore, the regulatory elements may include transcription termination signals, 
such as a poly-A signal, which lead to the addition of a poly A tail to the transcript which 
may improve its stability. 

In the case that a nucleic acid molecule according to the invention is expressed in sense 
orientation it is in principle possible to modify the coding sequence in such a way that the 
protein is located in any desired compartment of the plant cell. These include the nucleus, 



endoplasmatic reticulum, the vacuole, the mitochondria, the plastids, the apoplast, the 
cytoplasm etc. Since the interacting component of the protein of the invention excerts its 
effects in the cytoplasm and/or nucleus, corresponding signal sequences are preferred to 
direct the protein of the invention in the same compartment. Methods how to canry out this 
modifications and signal sequences ensuring localization in a desired compartment are 
well known to the person skilled in the art. 

Methods for the introduction of foreign DNA into plants are also well known in the art. 
These include, for example, the transfomnation of plant cells or tissues with T-DNA using 
Agrobacterium tumefaciens or Agrobacterium rhizogenes, the fusion of protoplasts, direct 
gene transfer (see, e.g., EP-A 164 575), injection, electroporation, biolistic methods like 
particle bombardment, pollen-mediated transfomnation, plant RNA virus-mediated 
transformation, liposome-mediated transformation, transformation using wounded or 
enzyme-degraded immature embryos, or wounded or enzyme-degraded embryogenic 
callus and other methods known in the art. The vectors used in the method of the 
invention may contain further functional elements, for example "left bordei'- and "right 
border"-sequences of the T-DNA of Agrobacterium which allow for stably integration into 
the plant genome. Furthermore, methods and vectors are known to the person skilled in 
the art which permit the generation of marker free transgenic plants, i.e. the selectable 
or scorable marker gene is lost at a certain stage of plant development or plant 
breeding. This can be achieved by, for example cotransformation (Lyznik, Plant Mol. 
Biol. 13 (1989), 151-161; Peng, Plant Mol. Biol. 27 (1995). 91-104) and/or by using 
systems which utilize enzymes capable of promoting homologous recombination in 
plants (see. e.g., WO97/08331; Bayley, Plant Mol. Biol. 18 (1992), 353-361); Lloyd, Mol. 
Gen. Genet. 242 (1994), 653-657; Maeser, Mol. Gen. Genet. 230 (1991). 170-176; 
Onouchi, Nucl. Acids Res. 19 (1991), 6373-6378). Methods for the preparation of 
appropriate vectors are described by, e.g., Sambrook (Molecular Cloning; A Laboratory 
Manual, 2nd Edition (1989), Cold Spring HartDor Laboratory Press, Cold Spring Hari3or, 
NY), 

Suitable strains of Agrobacterium tumefaciens and vectors as well as transformation of 
Agrobacteria and appropriate growth and selection media are well known to those 
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skilled in the art and are described in the prior art (GV3101 (pMK90RK), Koncz, MoL 
Gen. Genet 204 (1986), 383-396; C58C1 (pGV 3850kan). Deblaere, NucL Acid Res. 13 
(1985), 4777; Bevan. Nucleic. Acid Res. 12(1984), 8711; Koncz, Proc. Natl. Acad. Sd. 
USA 86 (1989). 8467-8471; Koncz, Plant Mol. Biol. 20 (1992). 963-976; Koncz, 
Specialized vectors for gene tagging and expression studies. In: Plant Molecular Biology 
Manual Vol 2, Gelvin and Schilperoort (Eds.), Dordrecht, The Netherlands: Kluwer 
Academic Publ. (1994). 1-22; EP-A-120 516; Hoekema: The Binary Plant Vector 
System, Offsetdrukkerij Kanters B.V.. Aiblasserdam (1985), Chapter V, Fraley, Crit, 
Rev. Plant Sci„ 4. 1-46; An, EMBO J. 4 (1985), 277-287). Although the use of 
Agrobacterium tumefadens is preferred in the method of the invention, other 
Agrobacterium strains, such as Agrobacterium rhizogenes, may be used, for example if a 
phenotype conferred by said strain is desired. 

Methods for the transfomiation using biolistic methods are well known to the person 
skilled in the art; see. e.g.. Wan, Plant PhysioL 104 (1994), 37^8; Vasil, Bio/Technology 
11 (1993), 1553-1558 and Christou (1996) Trends in Plant Science 1, 423-431. 
Microinjection can be performed as described in Potrykus and Spangenberg (eds.). Gene 
Transfer To Plants. Springer Veriag, Beriin, NY (1995). 

The transfomiation of most dicotyledonous plants is possible with the methods described 
above. But also for the transformation of monocotyledonous plants several successful 
transformation techniques have been developed. These include the transformation using 
biolistic metiiods as, e.g., described above as well as protoplast transfonmation, 
electroporation of partially permeabilized cells, introduction of DNA using glass fibers, etc. 
The term "transformation" as used herein, refers to the transfer of an exogenous 
polynucleotide into a hcst cell, irrespective of the method used for the transfer. The 
polynucleotide may be transientiy or stably introduced into the host cell and may be 
maintained non-integrated, for example, as a plasmid or as chimeric links, or 
alternatively, may be integrated into the host genome. The resulting transformed plant 
cell can then be used to regenerate a transformed plant in a manner known by a skilled 
person. 
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In general, the plants which can be modified according to the invention and which either 
show overexpression of a protein according to the invention or a reduction of the 
synthesis of such a protein can be derived from any desired plant species. They can be 
monocotyledonous plants or dicotyledonous plants, preferably they belong to plant 
species of interest in agriculture, wood culture or horticulture interest, such as crop plants 
(e.g. maize, rice, bariey, wheat, rye, oats etc.), potatoes, oil producing plants (e.g. oilseed 
rape, sunflower, pea nut soy bean, etc.), cotton, sugar beet, sugar cane, leguminous 
plants (e.g. beans, peas etc.), wood producing plants, preferably trees, etc. 

Thus, the present invention relates also to a transgenic plant cell which contains 
(preferably stably integrated into its genome) a nucleic acid molecule according to the 
invention linked to regulatory elements which allow expression of the nucleic acid 
molecule in plant cells and wherein the nucleic acid molecule is foreign to the transgenic 
plant cell. For the meaning of foreign; see supra. The presence and expression of the 
nucleic acid molecule in ttie transgenic plant cells leads to the synthesis of a cell cycle 
interacting protein and lends to physiological and phenotypic changes in plants containing 
such cells. 

Thus, the present invention also relates to transgenic plants and plant tissue comprising 
transgenic plant cells according to the invention. Due to the (over)expression of a cell 
cycle interacting protein of the invention, e.g., at developmental stages and/or in plant 
tissue in which they do not naturally occur these transgenic plants may show various 
physiological, developmental and/or morphological modifications in comparison to wild- 
type plants. 

Therefore, part of this invention is the use of cell cycle genes and/or cell cycle 
interacting proteins to modulate plant cell division and/or growth in plant cells, plant 
tissues, plant organs and/or whole plants. To the scope of the invention also belongs a 
method to influence the activity of cyclin-dependent protein kinase in a plant cell by 
transfonming the plant cell with a nucleic acid molecule according to the invention and/or 
manipulation of the expression of said molecule. More in particular using a nucleic acid 
molecule according to the invention, th disruption of plant cell division can be 
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accomplished by interfering in the expression of a substrate for cyclin-dependent protein 
kinase. The latter goal may be achieved, for example, with methods for reducing the 
amount of active cell cycle interacting proteins. 

For example, to obtain transgenic plants overexpressing the A. thaliana LDV115 gene 
(see Example 3), its coding region can be cloned, e.g., into the pAT7002 vector 
(Aoyama and Chua, Plant J. 11 (1997), 605-612). This vector allows inducible 
expression of the cloned inserts by the addition of the glucocorticoid dexamethasone. 
For example, following a polymerase chain reaction (PGR) technology the coding region 
of LDV115 can be amplified using appropriate primers, whereby a first primer contains 
an Xhof and a second primer contains an Spel restriction site. The obtained PGR 
fragment can be purified and cut with Xhol and Spel. Subsequently the fragment can be 
cloned into the Xhol and Spel sites of pTA7002. The resulted binary vector can be 
transferred into Agrobacterium tumefaciens. This strain can be used to transform 
Nicotiana tabacum cv. Petit havana using, e.g:, the leaf disk protocol (Horsh, Science 
227 (1985), 1229-1231) and Arabidopsis thaliana using, e.g., the root transformation 
protocol (Valvekens, PNAS 85 (1988). 5536-5540). Transgenic plants can then be 
selected on hygromycine 20 mg/l. Plants can be tested for LDV1 15 inducible expression 
as follows. 2 to 3 leaves of each transformant can be cut in two. Each half can be either 
submersed in 50 mM Na-citrate buffer (pH 5.8) with or without dexamethasone (0.03 
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using the Trizol reagents (Gibco-BRL) according to the manufactures and a northern gel 
cart b^ hjn usin^^^^^ 5 jig of RNArThe^^^ filter 
(HybondN+, Amersham) and hybridised with an LDV115 probe. Furthermore, seeds of 
transformants can be put on 34 MS medium with 1 % sucrose, both with and without 
dexamethasone. As a control SRI seeds should be included. In the presence of 
dexamethasone the growth behaviour of the transgenic plants as compared to the 
control plants is expected to be modified. For example, these transgenic plants may grow 
faster and/or have additional cells. Furthermore, said plant may be less sensitive to 
environmental stress compared to the corresponding wild type plant. 



Furthermore, the invention also relates to a transgenic plant cell which contains 
(preferably stably integrated into its genome) a nucleic acid molecule according to the 
invention or part thereof, wherein the transcription and/or expression of the nucleic acid 
molecule or part thereof leads to reduction of the synthesis of a cell cycle interacting 
protein. 

In a preferred embodiment, the reduction is achieved by an anti-sense, sense, ribozyme, 
co-suppression and/or dominant mutant effect. 

"Antisense" and "antisense nucleotides" means DNA or RNA constaicts which block the 
expression of the naturally occumng gene product 

The provision of the nucleic acid molecules according to the invention opens up the 
possibility to produce transgenic plant cells with a reduced level of the protein as 
described above and, thus, with a defect in cell division. Techniques how to achieve this 
are well known to the person skilled in the art. These include, for example, the 
expression of antisense-RNA. ribozymes, of molecules which combine antisense and 
ribozyme functions and/or of molecules which provide for a co-suppression effect; see 
also supra. When using the antisense approach for reduction of the amount of cell cycl 
interacting proteins in plant cells, the nucleic acid molecule encoding the antisense-RNA 
is preferably of homologous origin with respect to the plant species used for 
transformation. However, it is also possible to use nucleic acid molecules which display 
a high degree of homology to endogenously occuning nucleic acid molecules encoding 
a cell cycle interacting protein. In this case the homology is preferably higher than 80y , 
particulariy higher than 90% and still more preferably higher than 95%. The reduction of 
the synthesis of a protein according to the invention in the transgenic plant cells can 
result in an alteration in, e.g., cell division. In transgenic plants comprising such cells 
this can lead to various physiological, developmental and/or morphological changes, 
preferably to improved regeneration and transformation capacity of, e.g., cultured cells 
or wounded tissue. 

Thus, the present invention also relates to transgenic plants comprising the above- 
described transgenic plant cells. These may show, for example, a deficiency in cell 



division and/or reduced growth characteristics compared to wild type plants due to the 
stable or transient presence of a foreign DNA resulting in at least one of the following 
features: 

(a) disnjption of (an) endogenous gene(s) encoding a protein of the Jnvention; 

(b) expression of at least on antisense RNA and/or ribozyme^ against a transcript 
comprising a nucleic acid molecule of the Invention; 

(c) expression of a sense and/or non-translatable mRNA of the nucleic acid molecule 
of the invention; 

(d) expression of an antibody of the invention; 

(f) incorporation of a functional or non-functional copy of the regulatory sequence of 
the invention; or 

(g) incorporation of a recombinant DNA molecule or vector of the invention. 

The present invention also#relates to cultured plant tissues comprising transgenic plant 
cells as described above^whiehr either show overexpression of a protein according to the 
invention or a reduction in synthesis of sucfea proteinic 

Any transformed plant obtained according to the invention >can be used in a 
conventional breeding scheme or in in vitro plant propagation to produce more 
transformed plants with the same characteristics and/or can be used to introduce the 
same characteristic in other varieties of the same or related species. Such plants are 
also part of the invention. Seeds obtained from the transformed plants genetically also 
contain the same characteristic and are part of the invention. As mentioned before, the 
present invention is in principle applicable to any plant and crop that can be transformed 
with any of the transformation method known to those skilled in the art and includes for 
instance com, wheat, bariey, rice, oilseed crops, cotton, tree species, sugar beet, 
cassava, tomato, potato, numerous other vegetables, fruits. 

In yet another aspect, the invention also relates to harvestable parts and to propagation 
material of the transgenic plants according to the invention which either contain transgenic 
plant cells expressing a nucleic acid molecule according to the invention or which contain 
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cells which show a reduced level of the described protein. Harvestable parts can be In 
principle any useful parts of a plant, for example, flowers, pollen, seedlings, tubers, 
leaves, stems, fruit, seeds, roots etc. Propagation material includes, for example, seeds, 
fruits, cuttings, seedlings, tubers, rootstocks etc. 

Regulatory sequences of cell cycle interacting genes 

As mentioned above, the regulatory sequences that naturally drive the expression of the 
above described cell cycle interacting proteins may prove useful for the expression of 
heterologous DNA sequences in certain plant tissues and/or at different developmental 
stages in plant development 

Accordingly, in a further aspect the present invention relates to a regulatory sequence of 
a promoter naturally regulating the expression of a nucleic acid molecule of the 
invention described above or of a nucleic acid molecule homologous to a nucleic acid 
molecule of the invention. With methods well known in the art it is possible to isolate the 
regulatory sequences of the promoters that naturally regulate the expression of the 
above-described DNA sequences; see, e.g.. Example 4. For example, using the above 
described nucleic acid molecules as probes a genomic library consisting of plant 
genomic DNA cloned into phage or bacterial vectors can be screened by a person 
skilled in the art. Such a library consists e.g. of genomic DNA prepared from seedlings, 
fractionized in fragments ranging from 5 kb to 50 kb, cloned into the lambda GEM11 
(Promega) phages. Phages hybridizing with the probes can be purified. From the 
purified phages DNA can be extracted and sequenced. Having isolated the genomic 
sequences corresponding to the genes encoding the above-described cell cycle 
interacting proteins, it is possible to fuse heterologous DNA sequences to these 
promoters or their regulatory sequences via transcriptional or translational fusions well 
known to the person skilled in the art. In order to identify the regulatory sequences and 
specific elements of these cell cycle genes, 5'-upstream genomic fragments can be 
cloned in front of marker genes such as luc, gfp or the GUS coding region and the 
resulting chimeric genes can be introduced by means of Agrobacterium tumefaciens 
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mediated gene transfer into plants or transfected into plant cells or plant tissue for 
transient expression. The expression pattern observed in the transgenic plants or 
transfected plant cells containing the marker gene under the control of the regulatory 
sequences of the invention reveal the t>oundaries of the promoter and its regulatory 
sequences. Preferably, said regulatory sequence Is capable of confem'ng expression of 
a heterologous DNA sequence in main and lateral root meristems, shoot apical 
meristems, embryos at the globular, heart and torpedo stages, floral meristems and/or 
cambial cells in the stem. 

In context with the present invention, the term "regulatory sequence" refers to 
sequences which influence the specificity and/or level of expression, for example in the 
sense that they confer cell and/or tissue specificity; see supra. Such regions can be 
located upstream of the transcription initiation site, but can also be located downstream 
of it, e.g., in transcribed but not translated leader sequences. 

The term "promoter", within the meaning of the present invention refers to nucleotide 
sequences necessary for transcription initiation, i.e. RNA polymerase binding, and may 
also include, for example) the TATA box. 

The term "nucleic acid molecule homologous to a nucleic acid molecule of the 
invention", as used herein includes promoter regions and regulatory sequences of other 
cell cycle interacting protein encoding genes, such as genes from other species, for 
example, maize, alfalfa, potato, sorghum, millet, coix, bariey, wheat and rice the coding 
region of which share substantial homology to the cell cycle interacting proteins of the 
invention and which display substantially the same expression pattern. Such promoters 
are characterized by their capability of conferring expression of a heterologous DNA 
sequence in meristematic tissue and cells and other tissues mentioned above. 
Thus, according to the present invention, regulatory sequences from any species can be 
used that are functionally homologous to the regulatory sequences of the promoter of 
the above defined nucleic acid molecules, or promoters of genes that display an 
identical or similar pattern of expression, in the sense of being expressed in the above- 
mentioned tissues and cells. However, the expression confen^ed by the regulatory 
sequences of the invention may not be limited to. for example, root meristem cells but 



can include or be restricted to, for example, subdomalns of meristems. The particular 
expression pattern may also depend on the plantA/ector system employed. However, 
expression of heterologous DNA sequences driven by the regulatory sequences of th 
invention predominantly occurs in the meristem unless certain elements of the 
regulatory sequences of the invention, were taken and designed by the person skilled in 
the art to control the expression of a heterologous DNA sequence in other cell types. 

It is also immediately evident to the person skilled in the art that further regulatory 
elements may be added to the regulatory sequences of the invention. For example, 
transcriptional enhancers and/or sequences which allow for induced expression of the 
regulatory sequences of the invention may be employed. A surtable inducible system is 
for example tetracycline-regulated gene expression as described, e.g., by Gatz, supra. 

The regulatory sequence of the invention may preferably be derived from the above 
described cell cycle interacting genes. Plants that may be suitable sources for such 
genes have been described above. 

Usually, said regulatory sequence is part of a recombinant DNA molecule. In a preferred 
embodiment of the present invention, the regulatory sequence in the recombinant DNA 
molecule is operatively linked to a heterologous DNA sequence. 

The term heterologous with respect to the DNA sequence being operatively linked to the 
regulatory sequence of the invention means that said DNA sequence is not naturally 
linked to the regulatory sequence of the Invention. Expression of said heterologous DMA 
sequence comprises transcription of the DNA sequence, preferably into a translatable 
mRNA. Regulatory elements ensuring expression in eukaryotic cells, preferably plant 
cells, are well known to those skilled in the art. They usually comprise poly-A signals 
ensuring termination of transcription and stabilization of the transcript, see also supra- 
Additional regulatory elements may include transcriptional as well as translational 
enhancers; see supra. 
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In a preferred embodiment, the heterologous DNA sequence of the above-described 
recombinant DNA molecules encodes a peptide, protein, antisense RNA, sense RNA 
and/or ribozyme. The recombinant DNA molecule of the invention can be used alone or 
as part of a vector to express heterologous DNA sequences, which, e.g., encode 
proteins for, e.g., the control of disease resistance, modulation of nutrition value or 
diagnostics of cell cycle jelated gene expression. The recombinant DNA molecule or 
vector containing the DNA sequence encoding a protein of interest is introduced into the 
cells which in turn produce the RNA and optionally protein of interest For example, the 
regulatory sequences of the invention can be operatively linked to a lethal gene for use 
in the production of male and female sterility in plants. Suitable lethal genes include the 
Bacillus amyloliquefaciens ribonuclease (Hartlet, J. Mol. Biol. 89 (1985)) and the 
Bacillus amyloliquefaciens ribonuclease expressed with or without its inhibitor, barstar. 
Another example for a lethal gene is the catalytic A fragment of diphteria toxin 
(Tweeten, J. Bacteriol. 156 (1983), 680-685). Expression of diphteria toxin within yeast 
cells causes ADP-ribosylation of elongation factor 2, which leads to inhibition of protein 
synthesis and eventual cell death (Mattheakis, Mol. Cell. Biol. 12 (1992), 4026-4037). 

On the other hand, said protein can be a scorable marker, e.g., luciferase, green 
fluorescent protein or B-galactosidase. This embodiment is particulariy useful for simple 
and rapid screening methods for compounds and substances described herein below 
capable of modulating cell cycle interacting protein gene expression. For example, a cell 
suspension can be cultured in the presence and absence of a candidate compound in 
order to determine whether the compound affects the expression of genes which are 
under the control of regulatory sequences of the invention, which can be measured, 
e.g., by monitoring the expression of the above-mentioned marker. It Is also immediately 
evident to those skilled in the art that other marker genes may be employed as well, 
encoding, for example, a selectable marker which provides for the direct selection of 
compounds which induce or inhibit the expression of said marker. 

The regulatory sequences of the invention may also be used In methods of antisense 
approaches. The antisense RNA may be a short (generally at least 10, preferably at 
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least 14 nucleotides, and optionally up to 100 or more nucleotides) nucleotide sequence 
formulated to be conriplementary to a portion of a specific mRNA sequence and/or DNA 
sequence of the gene o\ interest. Standard methods relating to antisense technology 
have been described; see, e.g., Klann. Plant Physiol. 112 (1996), 1321-1330. Following 
transcription of the DNA sequence into antisense RNA, the antisense RNA binds to its 
target sequence within a cell, thereby inhibiting translation of the mRNA and down- 
regulating expression of the protein encoded by the mRNA. Thus, in a further 
embodiment, the invention relates to nucleic acid molecules of at least 15 nucleotides in 
length hybridizing specifically with a regulatory sequence as described above or with a 
complementary strand thereof. For the possible applications of such nucleic acid 
molecules, see supra. 

The present invention also relates to vectors, particulariy plasmids, cosmids, viruses 
and bacteriophages used conventionally in genetic engineering that comprise a 
recombinant DNA molecule of the invention. Preferably, said vector is an expression 
vector and/or a vector further comprising a selection marker for plants. For example of 
suitable selector markers, see supra. Methods which are well known to those skilled in 
the art can be used to construct recombinant vectors; see, for example, the techniques 
described in Sambrook, Molecular Cloning A Laboratory Manual, Cold Spring Harbor 
Laboratory (1989) N.Y. and Ausubel, Current Protocols in Molecular Biology, Green 
Publishing Associates and Wiley Interscience, N.Y. (1989). Alternatively, the 
recombinant DNA molecules and vectors of the invention can be reconstituted into 
liposomes for delivery to target cells; see also supra. 

The present invention furthermore relates to host cells transformed with a regulatory 
sequence, a DNA molecule or vector of the invention. Said host cell may be a 
prokaryotic or eukaryotic cell; see supra. 

In a further preferred embodiment, the present invention provides for a method for the 
production of transgenic plants, plant cells or plant tissue comprising the introduction of 
a nucleic acid molecule, recombinant DNA molecule or vector of the invention into the 





[DEBQ 



46 



genome of said plant, plant cell or plant tissue. For the expression of the heterologous 
DNA sequence under the control of the regulatory sequence according to the invention 
in plant cells, further regulatory sequences such as poly A tail may be fused, preferably 
3' to the heterologous DNA sequence, see also supra. Further possibilities might be to 
add Matrix Attachment Sites at the borders of the transgene to act as "delimiters" and 
insulate against methylation spread from nearby heterochromatic sequences. Methods 
for the introduction of foreign DNA into plants, plant cells and plant tissue are described 
above. 

Thus, the present invention relates also to transgenic plant cells which contain stably 
integrated into the genome a recombinant DNA molecule or vector according to the 
invention. 

Furthermore, the present invention also relates to transgenic plants and plant tissue 
comprising the above-described' transgenie^^plant cells.^ These plants may show, for 
example, modified architecture, increased yield or an increased tolerance to diseases, 
e.g., nematodes, geminivirusesr^ 

In yet another aspect the invention also relates to harvestable parts and to propagation 
material of the transgenic plants according to the invention which contain transgenic 
plant cells described above. Harvestable parts and propagation material can be in 

1^1 II aiijf vi«:>wiui ^ c« |^iaiifc, ^upia. 

W\th the regulatory sequerices of the invention, it wilf be possible to study 7/7 vivo gene 
expression related to cell cycle interacting proteins. Furthermore, since cell cycle 
interacting protein expression has different patterns in different stages of physiological 
and pathological conditions, it is now possible to determine further regulatory sequences 
which may be important for the up- or down-regulation of the expression or activity of 
cell cycle interacting proteins, for example in response to ions or elicitors. In addition, it 
is now possible to in vivo study mutations which affect different functional or regulatory 
aspects of specific gene expression in the cell cycle. Thus, the present invention also 



relates to the use of the above described regulatory sequences and recombinant DNA 
molecules of the invention for the expression of heterologous DNA sequences. 

The in vivo studies referred to above will be suitable to further broaden the knowledge 
on the mechanisms and genes involved in the control of the cell cycle. Expression of 
heterologous genes or antisense RNA under the control of the regulatory sequence of 
the present invention in plants and plant cells may allow the understanding of the 
function of each of these genes in the plant. 

As mentioned hereinbefore, the nucleic acid molecules and proteins of the present 
invention provide a basis for the development of mimetic compounds that may be 
inhibitors or activators of cell cycle interacting proteins or their encoding genes. It will be 
appreciated that the present invention also provides cell based screening methods that 
allow a high-throughput-screening (HTS) of compounds that may be candidates for such 
inhibitors and activators. 

Thus, the present invention further relates to a method for the identification of an 
activator or inhibitor of genes encoding cell cycle interacting proteins comprising the 
steps of: 

(a) culturing a plant cell or tissue or maintaining a plant comprising a recombinant 
DNA molecule comprising a readout system operatively linked to a regulatory 
sequence of the invention in the presence of a compound or a sample comprising 
a plurality of compounds under conditions which permit expression of said 
readout system; 

(b) identifying or verifying a sample and compound, respectively, which leads to 
suppression or activation and/or enhancement of expression of said readout 
system in said plant, plant cell, or plant tissue. 

The present invention further relates to a method for identifying and obtaining an 
activator or inhibitor of cell cycle interacting proteins comprising the steps of: 
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(a) combining a compound to be screened with a reaction mixture containing the cell 
cycle interacting protein of the invention and a readout system capable of 
interacting with the cell cycle interacting protein under suitable conditions which 
permit interaction of the cell cycle interacting protein with said readout system; 

(b) identifying or verifying a sample and compound, respectively, which leads to 
suppression or activation of the readout system. 

The term "read out system" in context with the present invention means any substrate 
that can be monitored, for example due to enzymatically induced changes. It also 
includes DNA sequences which upon transcription and/or expression in a cell, tissue or 
organism provide for a scorable and/or selectable phenotype. Such read out systems 
are well known to those skilled in the art and comprise, for example, substrates for 
protein kinases, recombinant DNA molecules and marker genes as described above. 

The term "plurality of compounds" in a method of the invention is to be understood as a 
plurality of substances which may or may not be identical: 

Said compound or plurality of compounds may be chemically synthesized or 
microbiologically produced and/or comprised in, for example, samples, e.g., cell extracts 
from, e.g., plants, animals or microorganisms. Furthermore, said compound(s) may be 
known in the art but hitherto not known to be capable of suppressing or activating cell 
cycle interacting proteins. The reaction mixture may be a cell free extract or may 
cqmprise a cell or tissue^^c^^^^^ set ups for the method of the invention are 

known to the person skilled in the art and are. for example, generally described in 
Alberts et al., Molecular Biology of the Cell, third edition (1994), in particular Chapter 17. 
The plurality of compounds may be, e.g., added to the reaction mixture, culture medium, 
injected into the cell or sprayed onto the plant. The cell or tissue that may be employed 
in the method of the invention preferably is a host ceil, plant cell or plant tissue of the 
invention described in the embodiments hereinbefore. 

If a sample containing a compound or a plurality of compounds is identified in the 
method of the invention, then it is either possible to isolate the compound from the 
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original sample identified as cx>ntaining the compound capable of suppressing or 
activating , cell cycle .interacting proteins, or one. can further subdivide the original 
sample, for example, if it consists of a plurality of different compounds, so as to reduce 
the number of different substances per sample and repeat the method with the 
subdivisions of the original sample. Depending on the complexity of the samples, the 
steps described above can be performed several times, preferably until the sample 
identified according to the method of the invention only comprises a limited number of or 
only one substance(s). Preferably said sample comprises substances of similar 
chemical and/or physical properties, and most preferably said substances are identical. 
Several methods are known to the person skilled in the art for producing and screening 
large libraries to identify compounds having specific affinity for a target. These methods 
include the phage-display method in which randomized peptides are displayed from 
phage and screened by affinity chromatography to an immobilized receptor, see, e.g., 
WO 91/17271, WO 92/01047, US-A-5.223,409. In another approach, combinatorial 
libraries of polymers immobilized on a chip are synthesized using photolithography; see, 
e.g.. US-A-5,143,854. WO 90/15070 and WO 92/10092. The immobilized polymers are 
contacted with a labeled receptor and scanned for label to identify polymers binding to 
the receptor. The synthesis and screening of peptide libraries on continuous cellulose 
membrane supports that can be used for identifying binding ligands of the polypeptide 
of the invention and thus possible inhibitors and activators is described, for example, in 
Kramer. Methods Mol. Biol. 87 (1998). 25-39, This method can also be used, for 
example, for determining the binding sites and the recognition motifs in the polypeptide 
of the invention. In like manner, the substrate specificity of the DnaK chaperon was 
determined and the contact sites between human interieukin-6 and its receptor; see 
Rudiger, EMBO J. 16 (1997). 1501-1507 and Weiergraber. FEBS Lett. 379 (1996). 122- 
126. respectively. Furthermore, the above-mentioned methods can be used for the 
construction of binding supertopes derived from the polypeptide of the invention. A 
similar approach was successfully described for peptide antigens of the anti-p24 (HIV-1) 
monoclonal antibody; see Kramer. Cell 91 (1997), 799-809. A general route to 
fingerprint analyses of peptide-antibody interactions using the clustered amino acid 
peptide library was described in Kramer, Mol. Immunol, 32 (1995), 459-465. In addition. 



antagonists of the polypeptide of the invention can be derived and identified from 
monoclonal antibodies that specifically react with the polypeptide of the invention In 
accordance with the methods as described in Doring, Mol. Immunol. 31 (1994), 1059- 
1067, 

More recently, WO 98/25146 described further methods for screening libraries of 
complexes for compounds having a desired property, especially,' the capacity to 
agonize, bind to, or antagonize a polypeptide or its cellular receptor. The complexes in 
such libraries comprise a compound under test, a tag recording at least one step in 
synthesis of the compound, and a tether susceptible to modification by a reporter 
molecule. Modification of the tether is used to signify that a complex contains a 
compound having a desired property. The tag can be decoded to reveal at least one 
step in the synthesis of such a compound. Other methods for identifying compounds 
which interact with the oroteins according to the invention or nucleic acid molecules 
encoding such molecules>^are,-^for^example,4he^i!lMt!2 screening^^^ phage display 
system as well as filter^^bindiogs^^ssays or "real .timeSimeasuring^gf interaction using, for 
example, the B I Acore apparatus (Ph^nmaeia)^ 

All these methods can^^be used* in accordance withMhe-^^present^invention to identify 
activators and antagonists ©f the^polypeptide^ofthe inventioni?** 

Various sources for the basic structure of such an activator or inhibitor can be employed 
and comprise, for example, mimetic analogs of the polypeptide of the invention. Mimetic 
analogs of the polypeptide of the invention or biologically active fragments thereof can 
be generated by, for example, substituting the amino acids that are expected to be 
essential for the biological activity with, e.g., stereoisomers, i.e. D-amino acids; see e.g.. 
Tsukida, J. Med, Chem. 40 (1997), 3534-3541. Furthenmore, in case fragments are 
used for the design of biologically active analogs Pro-mimetic components can be 
incorporated into a peptide to reestablish at least some of the conformational properties 
that may have been lost upon removal of part of the original .polypeptide; see, e.g., 
Nachman, Regul. Pept. 57 (1995), 359-370. Furthermore, the polypeptide of the 
invention can be used to identify synthetic chemical peptide mimetics that bind to or can 
function as a ligand, substrate, binding partner or the receptor of the polypeptide of the 




invention as effectively as does the natural polypeptide; see, e.g., Engleman, J. Clin. 
Invest. 99 (1 997). 2284-2292. 

The stmcture-based design and synthesis of low-molecular-weight synthetic nnolecules 
that nnimic the activity of the native biological polypeptide is further described in, e.g., 
Dowd. Nature Biotechnol. 16 (1998), 190-195; Kieber-Ennnnons, Current Opinion 
Biotechnol. 8 (1997). 435-441; Moore, Proc. West PharmacoL Soc. 40 (1997), 115-119; 
Mathews. Proc. West PhamnacoL Soc. 40 (1997), 121-125; Mukhija. European J. 
Blochem. 254 (1 998), 433-438. 

It is also well known to the person skilled in the art, that it is possible to design, 
synthesize and evaluate mimetics of snnall organic conDpounds that, for example, can 
act as a substrate or ligand to the polypeptide of the invention. For example. It has been 
described that D-glucose mimetics of hapalosin exhibited similar efficiency as hapalosin 
in antagonizing multidrug resistance assistance-associated protein in cytotoxicity; see 
Dinh, J. Med. Chem. 41 (1998), 981-987. 

The nucleic acid molecule of the invention can also serve as a target for activators and 
inhibitors. Activators may comprise, for example, proteins that bind to the mRNA of a 
gene encoding a polypeptide of the invention, thereby stabilizing the native 
conformation of the mRNA and facilitating transcription and/or translation, e.g., in like 
manner as Tat protein acts on HIV-RNA, Furthermore, methods are described in the 
literature for identifying nucleic acid molecules such as an RNA fragment that mimics 
the structure of a defined or undefined target RNA molecule to which a compound binds 
inside of a cell resulting in retardation of cell growth or cell death; see, e.g., WO 
98/18947 and references cited therein. These nucleic acid molecules can be used for 
identifying unknown compounds of pharmaceutical and/or agricultural interest, and for 
identifying unknown RNA targets for use in treating a disease. These methods and 
compositions can be used in screening for novel antibiotics, bacteriostatics, or 
modifications thereof or for identifying compounds useful to alter expression levels of 
proteins encoded by a nucleic acid molecule. Alternatively, for example, the 
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conformational stnjcture of the RNA fragment which mimics the binding site can be 
employed in rational drug design to modify known antibiotics to make them bind more 
avidly to the target. One such-methodology is nuclear magnetic resonance (NMR), 
which is useful to identify drug and RNA confomnational structures. Still other methods 
are, for example, the drug design methods as described in WO 95/35367. US-A- 
5,322,933, where the crystal staicture of the RNA fragment can be deduced and 
computer programs are utilized to design novel binding compounds which can act as 
antibiotics. 

The compounds which can be tested and identified according to a method of the 
invention may be expression libraries, e.g., cDNA expression libraries, peptides, 
proteins, nucleic acids, antibodies, small organic compounds, hormones, 
peptidomimetics, PNAs or the like (Milner, Nature Medicine 1 (1995). 879-880; Hupp. 
Cell 83 (1995), 237-245; Gibbs, Cell 79 (1 994), 193-198 and references cited supra). 
Furthenmore. genes encoding a putative regulator of cell cycle interacting protein and/or 
which excert their effects up- or downstream the cell cycle interacting protein of th 
invention may be identified using, for example, insertion mutagenesis using, for 
example, gene targeting vectors known in the art (see, e.g., Hayashi, Science 258 
(1992), 1350-1353; Fritze and Walden, Gene activation by T-DNA tagging. In Methods 
in Molecular biology 44 (Gartland. K.M.A. and Davey, M.R., eds). Totowa: Human Press 
ri995^. 281-294^ or transDOson taaaina fChandlee. Physiologia Plantarum 78 (1990), 
105-115). Said compounds can also be functional derivatives or analogues of known 
inhibitors or activators. Suchruseful compounds can be for example transacting factors 
which bind to the cell cycle interacting protein or regulatory sequences of the invention. 
Identification of transacting factors can be earned out using standard methods in the art 
(see, e.g., Sambrook. supra, and Ausubel, supra). To detemiine whether a protein binds 
to the protein or regulatory sequence of the invention, standard native gel-shift analyses 
can be carried out. In order to identify a transacting factor which binds to the protein or 
regulatory sequence of the invention, the protein or regulatory sequence of the invention 
can be used as an affinity reagent in standard protein purification methods, or as a 
probe for screening an expression library. The identification of nucleic acid molecules 



which encode proteins which interact with the cell cycle interacting proteins described 
above can also be achieved, for example, as described in Scofield (Science 274 (1996). 
2063-2065) by use of the so-called yeast "two-hybrid system"; see also the appended 
examples. In this system the protein encoded by the nucleic acid molecules according to 
the invention or a smaller part thereof is linked to the DNA-binding domain of the GAL4 
transcription factor. A yeast strain expressing this fusion protein and comprising a lacZ 
reporter gene driven by an appropriate promoter, which is recognized by the GAL4 
transcription factor, is transformed with a library of cDNAs which will express plant 
proteins or peptides thereof fused to an activation domain. Thus, if a peptide encoded by 
one of the cDNAs is able to interact with the fusion peptide comprising a peptide of a 
protein of the Invention, the complex is able to direct expression of the reporter gene. In 
this way the nucleic acid molecules according to the invention and the encoded peptide 
can be used to identify peptides and proteins interacting with cell cycle interacting 
proteins. It is apparent to the person skilled in the art that this and similar systems may 
then further be exploited for the identification of inhibitors of the binding of the interacting 
proteins. 

Once the transacting factor is identified, modulation of its binding to or regulation of 
expression of the cell cycle interacting protein of the invention can be pursued, 
beginning with, for example, screening for inhibitors against the binding of the 
transacting factor to the protein of the present invention. Activation or repression of cell 
cycle interacting proteins could then be achieved in plants by applying of the transacting 
factor (or its inhibitor) or the gene encoding it, e.g. in a vector for transgenic plants. In 
— - : addition, if the active form- of the^transacting- factor is a dimer, dominant-negative 
mutants of the transacting factor could be made in order to inhibit its activity. 
Furthermore, upon identification of the transacting factor, further components in the 
pathway leading to activation (e.g. signal transduction) or repression of a gene involved 
in the control of cell cyce then can be identified. Modulation of the activities of these 
components can then be pursued, in order to develop additional drugs and methods for 
modulating the cell cycle in animals and plants. 
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Thus, the present invention also relates to the use of the two-hybrid system as defined 
above for the identification of cell cycle interacting proteins or activators or inhibitors of 
such poteins 



Determining whether a compound is capable of suppressing or activating cell cycle 
interacting proteins can be done, for example, by monitoring DNA duplication and cell 
division. It can further be done by monitoring the phenotypic characteristics of the cell of 
the invention contacted with the compounds and compare it to that of wild-type plants. 
In an additional embodiment, said characteristics may be compared to that of a cell 
contacted with a compound which is either known to be capable or incapable of 
suppressing or activating cell cycle interacting proteins. 



The compounds isolated by the above methods also sen/e as lead compounds for the 
development of analog compounds. The analogs should have a stabilized electronic 
configuration and molecular confomriation that allows key functional groups to be 
presented to the receptor in substantially the same way as the lead compound. In 
particular, the analog compounds have spatial electronic properties which ar 
comparable to the binding region, but can be smaller molecules than the lead 
compound, frequently ha^/ing a molecular weight below about 2 kD and preferably below 
about 1 kD. Identification of analog compounds can be performed through use of 
techniques such as self-consistent field (SCF) analysis, configuration interaction (C!) 
analysis, and normal mode dynamics analysis. Computer programs for implementing 
these techniques are available; e.g.. Rein, Computer-Assisted Modeling of Receptor- 
Ligand Interactions (Alan Liss, New Yori^, 1989). Methods for the preparation of 
chemical derivatives and analogues are well known to those skilled in the art and are 
described in, for example, Beilstein, Handbook of Organic Chemistry, Springer edition 
New York Inc.. 175 Fifth Avenue, New York, N.Y. 10010 U.S.A. and Organic Synthesis, 
Wiley, New York, USA, Furthermore, said derivatives and analogues can be tested for 
their effects according to methods known in the art. Furthenmore, peptidomimetics 
and/or computer aided design of appropriate derivatives and analogues can be used, 
for example, according to the methods described above. 
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The inhibitor or activator identified by the above-described method may prove useful as 
a herbicide, pesticide, insecticide, antibiotic, tumor suppressing agent and/or as a cell 
growth regulator. Thus, in a further embodiment the invention relates to a compound 
obtained or identified according to the method of the invention said compound being an 
activator of cell cycle interacting proteins or an inhibitor of cell cycle interacting proteins. 
The above-described compounds include, for example, cell cycle kinase inhibitors. 
"Cell-cycle kinase inhibitor" (CKI) is a protein which inhibit CDK/cyclin activity and is 
produced and/or activated when further cell division has to be temporarily or 
continuously prevented. The antibodies, nucleic acid molecules, inhibitors and activators 
of the present invention preferably have a specificity at least substantially identical to the 
binding specificity of the natural ligand or binding partner of the cell cycle protein of the 
invention. In particular if cell cycle stimulation is desired. An antibody or inhibitor can 
have a binding affinity to the cell cycle interacting protein of the invention of at least 
10^M \ preferably higher than 10^M'^ and advantageously up to 10^°M'^ in case cell cycle 
suppression should be mediated. 

In a preferred embodiment, a suppressive antibody or inhibitor of the invention has an 
affinity of at least about 10'^ M, preferably at least about 10'® M and most preferably at 
least about 10"^'' M; and cell cycle stimulating activator has an affinity of less than about 
10"'' M, preferably less than about W^M and most preferably in order of 10"®M. 
In case of nucleic acid molecules it is preferred that they have a binding affinity to those 
encoding the amino acid sequences depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12 or 14 of 
at most 2-, 5- or 10-fold less than an exact complement of 20 consecutive nucleotides of 
the above described nucleic acid molecules. 

Preferably, the compound identified according to the above described method or its 
analog or derivative is further formulated in a therapeutically active form or in a form 
suitable for the application in plant breeding or plant cell and tissue culture. For 
example, it can be combined with a agriculturally acceptable cam'er known in the art. 
Thus, the present invention also relates to a method of producing a therapeutic or plant 
effective composition comprising the steps of one of the above described methods of 
the invention and combining the compound obtained or identified in the method of the 
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invention or an analog or derivative thereof with a pharmaceutically acceptable carrier or 
with a plant cell and tissue culture acceptable carrier. As is evident from the above, the 
present invention generally relates to compositions comprising >at -least one of the 
aforementioned nucleic acid molecules, vectors, proteins, regulatory sequences, 
recombinant DNA molecules, antibodies or compounds. Advantageously, said 
composition is for use as a medicament, a diagnostic means, a kit or as a plant effectiv 
composition. 

Compositions useful in agriculture and in plant cell and tissue culture 

Plant protection compositions can be prepared by employing the above-described 
methods of the invention and synthesizing the compound identified as inhibitor or 
activator in an amount sufficient for use in agriculture. Thus, the present invention also 
relates to a method for the preparation of an agricultural plant^protection composition 
comprising the above-described steps of the method of the invention and synthesizing 
the compound so identified or an analog or derivative thereof*^ 

In the plant protection composition of the invention, the compound identified by the 
above-described method may be preferentially-^^forrnulated by conventi means 
commonly used for the application of, for example, herbicides and pesticides or agents 
capable of inducing systemic acquired resistance (SAR). For example, certain additives 
known to those skilled in the art stabilizers or substances which facilitate the uptake by 
the plant cell, plant tissue or plant may be used. 

Pharmaceutical compositions 

The cell cycle interacting proteins of the invention appear to function in the cell division 
cycle which is similar in plants and animals. Accordingly, the nucleic acid molecules and 
proteins of the invention or derivatives thereof as well as the above described activators 
and inhibitors may be used to modulate the cell division cycle in animal, preferably 
mammalian cells which is integral to the development and spread of cancerous cells. A 
cell cycle interacting protein that acts as a basal transcription factor may promote 




cancer cell growth. In conditions where cell cycle interacting protein activity is not 
desirable, cells could be transfected with antisense sequences to cell cycle interacting 
protein encoding polynucleotides or provided with antagonists to the protein or its 
encoding gene. Thus, the above described antagonists or antisense molecules may be 
used to slow. stop, or reverse cancer cell growth. Thus, the present invention also 
relates to a method of producing a therapeutic agent comprising the steps of the 
methods described hereinbefore and synthesizing the activator or inhibitor obtained or 
identified in step (c) or ai. analog or derivative thereof in an amount sufficient to provide 
said agent in a therapeutically effective amount to a patient. 

Compounds identified by the above methods or analogs are fomnulated for therapeutic 
use as pharmaceutical compositions. The compositions can also Include, depending on 
the fomiulation desired, pharmaceufically acceptable, usually sterile, non-toxic carriers 
or diluents, which are defined as vehicles commonly used to fomiulate pharmaceutical 
compositions for animal or human administration. The diluent is selected so as not to 
affect the biological activity of the combination. Examples of such diluents are distilled 
water, physiological saline. Ringer's solutions, dextrose solution, and Hank^s solution. In 
addition, the pharmaceutical composition or formulation may also include other carriers, 
adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like, A 
therapeutically effective dose refers to that amount of protein or its antibodies, 
antagonists, or inhibitors which ameliorate the symptoms or condition. Therapeutic 
efficacy and toxicity of such compounds can be detenmined by standard pharmaceutical 
procedures in cell cultures or experimental animals, e.g., ED50 (the dose therapeutically 
effective in 50% of the population) and LD50 (the dose lethal to 50% of the population). 
The dose ratio between therapeutic and toxic effects is the therapeutic index, and it can 
be expressed as the ratio. LD50/ED50. 

Diagnostic means and kits 

The invention also relates to a diagnostic composition comprising at least one of the 
aforementioned nucleic acid molecules, vectors, proteins, antibodies or compounds and 
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optionally suitable means for detection. Said diagnostic compositions may be used for 
methods for detemnining expression of cell cycle interacting proteins by detecting the 
presence of the con-espondihg mRNA which comprises isolation of mRNA from a cell 
and contacting the mRNA so obtained with a probe comprising, a nucleic acid probe as 
described above under hybridizing conditions, detecting the presence of mRNA 
hybridized to the probe, and thereby detecting the expression of the protein in the cell. 
Further methods of detecting the presence of a protein according to the present 
invention comprises immunotechniques well known in the art, for example enzyme 
linked immunosorbent assay. Furthermore, it Is possible to use the nucleic acid 
molecules according to the invention as molecular maricers in plant breeding. Moreover, 
the present invention relates to a kit comprising at least one of the aforementioned 
nucleic acid molecules, regulatory sequences, recombinant DNA molecules, vectors, 
proteins, compounds or antibodies of the invention. The kit of the invention may contain 
further ingredients such as selection markers and components^^fofc selective media 
suitable for the generation of transfamfied host cells«and'«tramsgenic*plant cells, plant 
tissue or plants. Furthermore! the kit may* include buffers and^ substrates for reporter 
genes that may be present in the recombinant gene or vector of the invention. The kit of 
the invention may advantageously be used for carrying out the method of the invention 
and could be, inter alia, employed in a variety of applications referred to herein, e.g., in 
the diagnostic field or as research tool. The parts of the kit of the invention can be 
packaged individually in vials or in combination in containere or multicontainer units. 
Manufacture of the kit follows preferably standard procedures which are known to the 
person skilled in the art. The kit or its ingredients according to the inventlbri can be used 
in plant cell and plant tissue cultures, for example, for any of the above described 
methods for detecting inhibitors and activators of cell cycle genes. The kit of the 
invention and its ingredients are expected to be very useful in breeding new varieties of. 
for example, 'plants whichtdisplay improved properties«such. as nutiitial value or disease 
resistance. 



Further applications of the Invention 



The person skilled in the art can use proteins according to the invention from other 
organisms such as yeast and animals to influence cell division progression in those 
other organisms such as mammals or insects. In a prefen-ed embodiment one or more 
DNA sequences, vectors or proteins of the invention or the above-described antibody or 
compound are, for instance, used to specifically interfere in the dismption of the 
expression levels of genes involved in G1/S transition in the cell cycle process In 
transformed plants, particularly : 

• in the complete plant 

• in selected plant organs, tissues or cell types 

• under specific environmental conditions, including abiotic stress such as cold, 
heat, drought or salt stress or biotic stress such as pathogen attack 

• during specific developmental stages. 

Specifically the plant cell division rate and/or the inhibition of a plant cell dWision can be 
influenced by (partial) e'imination of a gene or reducing the. expression of a gene 
encoding a protein according to the invention. Said plant cell division rate and/or the 
inhibition of a plant cell division can also be influenced by eliminating or inhibiting the 
activity of the protein according to the invention by using for instance antibodies directed 
against said protein. As a result of said elimination or reducHon greater organisms or 
specific organs or tissues can be obtained; greater in volume and in mass too. 
Furthermore inhibition of cell division by various adverse environmental conditions such 
as drought, high salt content, chilling and the like can be delayed or prevented by 
reduction or enhancing (e.g. with a dominant negative version) of said expression of a 
gene according to the invention. The division rate of a plant cell can also be influenced 
in a transformed plant by overexpression of a nucleic acid molecule according to the 
invention. Therefore an important aspect of the cunent Invention is a method to modify 
plant architecture by overproduction or reduction of expression of a sequence according 
to the invention under the control of a tissue, cell or organ specific promoter. Another 
aspect of the present invention is a method to modify the growrth inhibition of plants 



caused by environmental stress conditions above mentioned by appropriate use of 
sequences according to the invention. Surprisingly using a polypeptide or fragment 
thereof according to the invention or using antisense RNA or any method to reduce the 
expression of the gene according to the invention, cell division in the meristem of both 
main and lateral roots, shoot apical or the vascular tissue of a plant can be manipulated. 
Furthermore any of the DNA sequences of the invention can be used to manipulat 
(reduce or enhance) the level of endopolyploidy and thereby increasing the storage 
capacity of, for example, endosperm cells. Thus, another aspect of the current invention 
is that one or more DMA sequences, vectors or proteins, regulatory sequences or 
recombinant DNA molecules of the invention or the above-described antibody or 
compound can be used to modulate, for instance, endoreduplication in storage cells, 
storage tissues and/or storage organs of plants or parts thereof. The term 
"endoreduplication" means recurrent DNA replication without consequent mitosis and 
cytokinesis. 

Preferred target storage organs and parts thereof for the modulation of 
endoreduplication are, for instance, seeds (such as from cereals, oilseed crops), roots 
(such as in sugar beet), tubers (such as in potato) and fruits (such as in vegetables and 
fruit species). Furthermore it is expected that increased endoreduplication in storage 
organs and parts thereof correlates with enhanced storage capacity and as such with 
improved yield. In yet another embodiment of the invention, a plant with modulated 
endoreduplication in the whole plant or parts thereof can be obtained from a single plant 
cell by transforming the cell, in a manner known to the skilled person, with the above- 
described means; 

In view of the foregoing, the present invention also relates to the use of a DNA 
sequence, vector, protein, antibody, regulatory sequences, recombinant DNA molecule, 
nucleic acid molecules or compound of the invention for modulating plant cell cycl , 
plant cell division and/or growth, for Influencing the activity of cell cycle interacting 
protein, for disrupting plant cell division by influencing the presence or absence or by 
interfering in the expression of a cyclin-dependent protein, for modifying growth 
inhibition of plants caused by environmental stress conditions, for inducing male or 
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female sterility, for influencing cell division progression in a host as defined above or for 
use in a screening method for the identification of inhibitors or activators of cell cycle 
proteins. 

Furthermore, it is possible to use the nucleic acid molecules according to the invention 
as molecular markers in plant breeding. Thus, the present invention also relates to the 
use of a DNA sequence or regulatory sequence of the invention as a marker gene in 
plant or animal cell and tissue culture or as a marker in marker-assisted plant breeding. 
Moreover, the overexpression of nucleic acid molecules according to the invention may 
be useful for the alteration or modification of plant/pathogene interaction. The term 
"pathogene" includes, for example, bacteria, viruses and fungi as well as protozoa. 

These and other embodiments are disclosed and encompassed by the description and 
examples of the present invention. Further literature conceming any one of the 
methods, uses and compounds to be employed in accordance with the present 
invention may be retrieved from public libraries, using for example electronic devices. 
For example the public database "Medline" may be utilized which is available on the 
Internet, for example under http://www.ncbi.nIm,nih.gov/PubMed/medIine.html. Further 
databases and addresses, such as http://www.ncbi.nlm.nih.gov/, 
http://wwwjnfobiogen.fr/, http://www.fmi.ch/biology/research_tools.htmI, http://www.tigr. 
org/, are known to the person skilled in the art and can also be obtained using, e.g., 
http://www.lycos.com. An overview of patent information in biotechnology and a survey 
of relevant sources of patent information useful for retrospective searching and for 
cun-ent awareness is given in Berths, TIBTECH 12 (1994), 352-364. 

The present invention ir. further illustrated by reference to the following non-limiting 
examples. 

Unless stated otherwise in the examples, all recombinant DNA techniques are 
performed according to protocols as described in Sambrook et al. (1989), Molecular 
Cloning : A Laboratory Manual. Cold Spring Harbor Laboratory Press, NY or in Volumes 
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1 and 2 of Ausubel et al. (1994), Current Protocols in Molecular Biology, Current 
Protocols. Standard materials and methods for plant molecular work are described in 
Plant Molecular Biology Labfase (1993) by R.D.D. Croy, jointly published by BIOS 
Scientific Publications Ltd (UK) and Blackwell Scientific Publications (UK). 



Example 1: Identification of cell cycle interacting proteins using the two hybrid 
system with CDC2b as a bait 

A two-hybrid screening was performed using as bait a fusion between the GAL4 DNA- 
binding domain and CDC2bAt Vectors and strains used were provided with the 
Matchmaker Two-Hybrid System (Clontech, Palo Alto. CA). The bait was constructed by 
inserting the CDC2bAt PCR fragment into the pGBT9 vector. The PGR fragment was 
created from the cDNA using primers to incorporate EcoRI restriction enzyme sites (5*- 
CGGATCCGAATTCATGGAGAACGAG-3' (SEQ ID NO: 15) and 5'- 
CGGATCCGAATTCTCAGAACTGAGA-S') (SEQ ID NO: 16). The>,PCR fragment was 
cut with EcoRI and cloned into the EcoRI site of pGBT9, resulting in the plasmid 
pGBTCDC2B. The GAL4 activation domain cDNA fusion library was obtained from 
Clontech from mRNA of Arabidopsis thaliana cell suspensions harvested at various 
growing stages: eariy exponential, exponential, eariy stationary, and stationary phase. 
For the screening a 1 -liter culture of the Saccharomyces cerevisiae strain HF7c {MATa 
ura3-52 his3-200 ade2-101 Iys2-801 trp1-901 Ieu2-3.112 gal4-542 gal80-538 
LYS2::GAL1^j;^-GAL1j„f,-HIS3 URA3::GAL4,rr^r^yClTATA-LacZi was cotransformed 
with 590 pg pGBTCDC2B, 1100 pg DNA of the library, and 40 mg salmon sperm earner 
DNA using the lithium acetate method (Gietz ef al.. 1992). To estimate the number of 
independent cotransformants. 1/1000 of the transfomriation mix was plated on Leu- and 
Trp- medium. The rest of the transformation mix was plated on medium to select for 
histidine prototrophy (Trp-, Leu-. His-). After 5 days of growth at 30°C, the colonies 
larger than 2 mm were streaked on histidine-lacking medium. A total of 10^ independent 
cotransformants were screened for there ability to grow on histidine free medium. A 5- 
day incubation at SOX yielded 352 colonies. Of the His* colonies the activation domain 





plasmids were isolated as described (Hoffman and Winston, 1987, Gene 57, 267-272). 
The hybriZAP™ inserts were PCR amplified using the primers 5 - 
AGGGATGTTTAATACCACTAC-3' (SEQ ID NO: 17) and 5'- 
GCACAGTTGAAGTGAACTTGC-3' (SEQ ID NO: 18). PCR fragments were digested 
with Alu\ and fractionized on a 2% agarose gel. Plasmid DNA of which the inserts gave 
rise to different restriction patterns were electroporated into Escherichia coli XL1-Blue, 
and the DNA sequence of the inserts was determined. Extracted DNA was also used to 
retransform HF7c to test the specificity of the interaction. 

Example 2: Identification of cell cycle interacting proteins using the two hybrid 
system with CDC2a as a bait 

For the identification of cell cycle interacting proteins also a two hybrid system based on 
GAL4 recognition sites to regulate the expression of both his3 and lacZ reporter genes 
was used to identify CDC2aAt-interactlng of proteins. The bait used for the two-hybrid 
screening was constmcted by inserting the CDC2aAt coding region into the pGBT9 vector 
(Clontech). The insert was created by PCR using the CDC2aAt cDNA as template. 
Primers were designed to incorporate EcoRI restriction enzyme sites. The primers used 
were 5'.CGAGATCTGAATTCATGGATCAGTA-3' (SEQ ID NO: 19) and 
5'-CGAGATCTGAATTCCTAAGGCATGCC-3' (SEQ ID NO: 20). The PCR fragment was 
cut with EcoRl and cloned into the EcoRI site of pGBT9, resulting in the pGBTCDC2A 
plasmid. For the screening a GAL4 activation domain cDNA fusion library was used 
constructed from Arabidopsis thaliana cell suspension cultures. This library was 
constructed using RNA isolated from cells harvested at 20 hours, 3, 7 and 10 days after 
dilution of the culture in new medium. These time point correspondent to cells from the 
eariy exponential growth phase to the late stationary phase. mRNA was prepared using 
Dynabeads oligo(dT)25 according to the manufacturer's instructions (Dynal). The GAL4 
activation domain cDNA fusion library was generated using the HybriZAP™ vector 
purchased with the HybriZAP™ Two-Hybird cDNA Gigapack cloning Kit (Stratagene) 
following the manufacturer's instmctions. The resulting library contained approximately 
3.10® independ nt plaque-forming units, with an average insert size of 1 Kb. 
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For the screening a 1 -liter culture of the Saccharomyces cerevisiae strain HF7c (AMra 
ura3-52 his3-200 ade2-101 Iys2-801 trp1-901 Ieu2-3.112 gal4-542 gal80-538 
LYS2::GAL1^-GAL1j„^-HIS3 URA3::GAL4„^^^-CyC1^^j^-Lac2) was cotransfomied 
with 400 HQ pGBTCDC2A, 500 fig DNA of the library, and 40 mg salnnon sperm carrier 
DNA using the lithium acetate method (Gietz et a/. 1992, Nucleic Acids Res. 20, 1425). To 
estimate the number of independent cotransfonnants, 1/1000 of the transfomiation mix 
was plated on Leu" and Trp" medium. The rest of the transformation mix was plated on 
medium to select for histidine prototrophy (Trp", Leu", His"). Of a total of approximately 1.2 
X 10^ independent transfomiants 1200 colonies grew after 3 days of Incubation at 30°C. 
The colonies larger than 2 mm were streaked on histidine-lacking medium supplemented 
with 10 mM 3-amino-1 ,2.4-triazole (Sigma). Two-hundred-fifty colonies capable of growing 
under these conditions were tested for |J-galactosidase activity as described (Breedon and 
Nasmyth 1995, Cold Spring Harbor Symp. Quant. Biol. 50, p643-650). and 153 turned out 
to be His* and LacZ*. Plasmid DNA was prepared from the positive clones and 
sequenced. 



Example 3: Cell cycle interacting proteins associating with Cdc2aAt or Cdc2bAt 

Nine cDNA clones were obtained by the method described in Example 1 and 2, which 
are further described below. The specificity of the interaction those clones was verified 
by the retransfonr.atlon of yeast with pGSTCDC2A or pGBTCDC2B and the 
corresponding cDNA dones. As controls, pGBTCDC2A or pGBTCDC2B was 
cotransformed with a vector containing only the GAL4 activation domain (pGA[>424); 
and the nine cDNA vectors were each cotransfomied with a plasmid containing only the 
GAL4 DNA binding domain (pGBT9). Transformants were plated on medium with or 
without histidine. Only transfomiants containing both pGBTCDC2A or pGBTCDC2B and 
one of the nine cDNA clones were able to grow in the absence of histidine. 

1. Vb89 (SEQ ID NO: 7) 



A BLAST data base search revealed that the Vb89 clone encode the Arabidopsis 



thaliana HAL3 homologue, isolated recently and of which the function was unknown. 
Unexpectingly, the Vb89 clone interacts with CDC2bAt, but not with CDC2aAt in the 
two-hybrid system. The interaction of Vb89 with CDC2bAt highlights an important role of 
Vb 89 in cell cycle control. The publicly available databases were screened with the 
cDNA VB89. An overall perfect homology with HAL3, already known in the databases 
was found. With the help of BLASTX U80192 (score 1.9e-106) was found as the best 
homologue. This sequence is a partial cDNA from A.thaliana (entered in the databank: 
28-APR-1997)(with publ.:Culianez-Macia,F.A., Espinosa-RuizA and Sen-ano.R, 
Arabldopsis thaliana HAL3 homolog gene, Unpublished). Except that VB89 is longer, 
there are no major differences with this cDNA. 

HAL3 is a halotolerant gene isolated in Saccharomyces cerevisiae (Ferrando, 1995). 
Hal3p can inhibit the Ppz1 protein phosphatase resulting in an increased resistance to 
sodium and lithium. These effect is largely a result of the increased expression of the 
ENA/PMR2A gene. This gene codes for a P-type ATPase responsible for sodium efflux 
(De Nadal et al., 1998), The HAL3 gene has also been isolated independently (as SIS2) 
and characterized on the basis of its ability to increase, when present in high copy 
number, the growth rate of sitA mutants (Di Como et al, 1995). The SIT4/PPH1 gene 
encodes a type 2A-related Ser/Thr protein phosphatase that is required in late G1 for 
nonmal G1 cyclin expression and for bud formation. Interestingly, overexpression of 
HAL3/SIS2 stimulates the rate of cyclin accumulation in sit4 mutants. 

De Nadal E„ Clotet J„ Posas F., Senrano R.. gomez N., Anno J. (1998). The yeast 
halotolerant determinant HaISp is an inhibitory subunit of the Ppz1p Ser/Thr protein 
phosphatase. Proc. Natl. Acad. Sci. USA, 95: 7357-7362. 

Di Como C.J., Bose R., Amdt K.T. (1995). Overexpression of SIS2, Which contains an 
extremely acidic region, increases the expression of SWI4, CLN1 and CLN2 in Sit4 
mutants. Genetics, 139: 95-107. 

Ferrando A., Kron S.J., Rios G,, Fink G.R., Senrano R. (1995). Regulation of cation 
transport in Saccharomyces cerevisiae by the salt tolerance gene HAL3. Molecular and 
cellular Biology, 1 5:5470-5481 . 



2, VbDAHP (SEQ ID NO: 9) .^ 



When a BLAST data base was used rt was found that the VbDAHP clone encode a 3- 
deoxy-D-arab/no-heptulosonate 7-phosphate (DAHP) synthase with a high similarity to 
the DHS2 gene. The VbDAHP clone interacts with CDC2bAt. but not with CDC2aAt in 
the two-hybrid system. The publicly available databases were screened with the cDNA 
VBDAHP (SEQ ID NO: 9). An overall perfect homology was found with DAHP 
{AROG_ARATH 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase [Arabidopsis 
thaliana]), already known in the databases. With the BLASTX as best homologue 
Q00218 (score 1.9e-49, C-term; 5.6e-86, N-term) was found. This sequence Is a 
complete mRNA from A.thaliana (entered in the databank: 01-NOV-1997) (with 
publ.:Keith, Proc. NatL^Acad. ^ ScL U.S.A. 88 (19), J821:^8825^r^(1991)). With the 
BLASTN/nr we found4he same DAHP, 

In Ara/)/dops/s f/ja//ana, . two genes has been isolated encoding 3-deoxy-D-arab//70- 
heptulosonate 7-phosphate (DAHP) synthase, an enzyme catalyzing the first committed 
step in aromatic amino acid biosynthesis (Keith et al., 1991). Both genes, DHS1 and 
DHS2, may have distinct physiological roles, as there are differentially expressed in 
plants subjected either to physical wounding or to infiltration by virulent and avirulent 
strains of Pseudomonas syringae. Other enzymes in the Arabidopsis aromatic pathway 
are also encoded by duplicated genes, an arrangement that may allow independent 
regulation of aromatic amino acid biosynthesis by distinct physiological requirements 
such as protein synthesis and secondary metabolism. 

Keith B., Dong X., Ausubei P.M.. Fink G,R. (1991) Differential induction of 3-deoxy-D- 
arafe/no-heptulosonate 7-phosphate synthase genes in Arabidopsis thaliana by 
wounding and pathogenic attack, Proc. Natl. Acad. Sci. USA, 88: 8821-8825. 



3. VbHSF (SEQ ID NO: 13) 



A BLAST data base search revealed that the VbHSF clone is very similar to the 
Arabidopsis thaliana Heai-Shock Transcription Factor HSF3. The VbHSF clone interacts 
with CDC2bAt, but not with CDC2aAt with the two-hybrid system. Organisms synthesize 
heat shock proteins (HSPs) In response to sublethal heat stress and concomitantly 
acquire increased tolerance against a subsequent, othenA^ise lethal, heat shock. Heat 
shock factor (HSF) is essential for the transcription of many HSP genes. Recently two 
HSF genes, HSF3 and HSF4, were Isolated from an Arabidopsis cDNA library (Prandl et 
aL, 1998). Transgenic Arabidopsis plants were generated containing constructs that 
allow expression of HSF3 and HSF4 or the respective translational beta-glucuronidase 
(GUS) fusions. Overexpression of HSF3 or HSF3-GUS, but not of HSF4 or HSF4-GUS, 
causes HSP synthesis at the non-heat-shock temperature of 25 degrees C in transgenic 
Arabidopsis. In transgenic plants bearing HSF3/HSF3-GUS, transcription of several 
heat shock genes is derepressed. Electrophoretic mobility shift assays suggest that 
derepression of the heat shock response is mediated by HSF3/HSF3-GUS functioning 
as transcription factor, HSF3/HSF3-GUS-overexpressing Arabidopsis plants show an 
increase in basal thermololerance, indicating the importance of HSFs and HSF- 
regulated genes as determinants of thermoprotective processes. Plants transgenic for 
HSF3/HSF3-GUS exhibit no other obvious phenotypic alterations. 
Derepression of HSF activity upon overexpression suggests the titration of a negative 
regulator of HSF3 or an intrinsic constitutive activity of HSF3. Stable overexpression of 
HSFs may be applied to other organisms as a means of derepressing the heat shock 
response. 

A possible regulatory interaction between heat shock response and cell cycle control in 
plants has already been suggested. Reindl et al. (1997) reported the phosphorylation of 
the Arabidopsis heat-shock transcription factor HSF1 by a cyclin-dependent kinase. The 
HSF1 kinase forms a stable complex with AtHSFI. The HSF1 kinase interacts with the 
cell-cycle control protein Sudp and is immunoprecipitated by an antibody specific for 



the Arabidopsis cyclin-dependent CDC2a kinase. Phosphorylation by CDC2a in vitro 
inhibits DNA binding of AtHSFI to the cognate heat-shock elements. 

Different studies have shown that Heat shock factors can serve as auxiliary proteins in 
formation of CDK/cyclin complexes. For example during meiosis I of mouse 
spermatocytes it is proposed that HSP70-2 assists in CDC2/cylinBi1 complex formation 
through interaction with CDC2 and that this interaction establishes and/or maintains the 
CDC2 protein in a conformation that is competent for cyclin B1 binding (Zhu et al, 1997), 

Prandl R, Hinderhofer K, Eggers-Schumacher G, Schoffi F. (1998). HSF3, a new heat 
shock factor from Arabidopsis thaliana, derepresses the heat shock response and 
confers thermotolerance when overexpressed in transgenic plants. Mol Gen Genet 1998 
May;258(3):269-78. 

Reindl A, Schoffi F, Schell. J, Koncz C, BSko L (1997): Phosphorylation by a cyclin- 
dependent kinase modulates DNA binding jgf^the Arabidopsis heat-^shock transcription 
factor HSF1 in vitro: Plants Physiol 1 99^Sep; 11 5(1 ):93^00i^ 

Zhu D., Dix DJ.. Eddy E.M. (1997). HSP70-2 is required for GDC2 kinase activity in 
meiosis I of mouse spermatocytes. Development 124: 3007-3014. 

4. VbDBP(SEQ tP NO: 11) 

When a BLAST data base was used it was found that the VbDBP clone is very similar to 
the putative DNA binding protein {Arabidopsis thaliana) and also contains a lot of 
homologies with PCF2 {Oryza sativa). VbDBP interacts with CDC2b but not with 
CDC2a. The publicly available databases were screened with the cDNA VBDPBP (N- 
term). With the help of BLASTX gene21 from AC003680 (score 1 .Oe-27) was found as 
best homologue. This is a genomic sequence from A.thaliana (entered in the 
databank:20-MAR-1998), chromosome II. The prediction made here gives 1 big exon, 
but the new predictions made in accordance with the present invention gave two exons 
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(the big one, followed by a small one). The cDNA VBDPBP shows not so high homology 
(gene 21 might only be from the same family as VBDPBP) with the big exon, so 
completion of the cDNA will confinm one or the other annotation and might give a new 
sequence. Other homologues are D87261) PCF2 [Oryza sativa] (score 9.2e-27) and 
D87260) PCF1 [Oryza sativa] (score 8.5e-24) both with publication: Kosugi, S. and 
Ohashi,Y. PCF1 and PCF2 specifically bind to cis elements in the rice proliferating cell 
nuclear antigen gene. Plant Cell 9 (9), 1607-1619 (1997). With the help of BLASTN/nr 
an other genomic sequence from chromosome V, AB010072 (2e-12) (08-JAN-1998) 
sequenced by the KAOS-people (P1 clone: MEE6) was found. The region with 
homology is located between (18754.. 18848) has no annotations at all. The publicly 
available databases was screened with the cDNA VBDPBP (C-term (SEQ ID NO: 15)) 
but nothing was found with BLASTX. 

PCF1 and PCF2 are proteins isolated in rice that specifically bind to sites lla and lib in 
the promotor region of the rice PCNA gene (Kosugi et al., 1997). The rice proliferating 
ceil nuclear antigen (PCNA) protein is an auxiliary protein of DNA polymerase (that 
participates in a variety of processes, such as DNA replication, DNA repair synthesis, 
and cell cycle control through reactions with the CDK-cyclin-CKI complex. The PCNA 
gene is induced at the Gl-to-S phase boundary and is well conserved in eukaryotes. 
The expression of the rice PCNA gene is restricted exclusively to meristematic regions 
and is controlled at the transcriptional phase. PCNA protein is also present in 
proliferating cells but absent from nondividing cells and terminally differentiated plant 
tissues. 

Loss-of-function analysis of the rice PCNA promoter using transgenic plants has 
demonstrated that two elements (sites lla and lib) in the proximal region are essential 
for the proliferating cell-specific transcriptional activity. On the other hand, two repeated 
site lla sequences located upstream of the cauliflower mosaic virus 35S minimal 
promoter confer transcriptional activation in tobacco protoplast. This suggests that sites 
lla and lib most probably function as positive cis-acting elements in proliferating cells. 
The proteins PCF1 and PCF2 specifically bind to sites lla and lib in the promoter region 
of the rice PCNA gene and may act as transcription factors to control DNA synthesis- 



related genes in plants. In particular, PCF2, with a high level of DNA binding activity in 
meristematic tissues, may act as transcriptional activator for these genes. These 
proteins have a deduced basic helix-loop-helix (bHLH) motif that is responsible for DNA 
binding and dimerization. PCF1 and PCF2 are novel types of bHLH proteins that are 
distinct from other known bHLH transcriptional factors. 

Kosugi, S., and Ohasi Y. (1997) PCF1 and PCF2 specifically bind to cis elements in the 
Rice proliferating cell nuclear antigen gene. The Plant Cell, 9, 1607-1619. 

5. Vb33 (SEQ ID NO: 5) 

The Vb33 done encodes a protein interacting with CDC2b but not with CDC2a. The 
publicly available databases were screened with the cDNA VB33. With the BLASTX as 
best homologue a predicted gene on the Z49937 sequence having a similarity with an 
ankyrin motif (score 0.62) was found. This sequence comes from C.elegans cosmid and 
the gene F14F3.2 was predicted based on a C.elegans EST (yk192g4.5). 

6. LDV24 (SEQ ID NO: 3) 

The LDV24 gene encodes a protein interacting with CDC2a and being highly similar to 
the PREG1 and PHO80 proteins of A/eurospora crassa and Saccharomyces cerevisiae, 
respectively. The publicly available databases were screened with the cDNA LDV24. 
With the BLASTX as best homologue the PREG(AF051226) protein from Picea mariana 
(score: 1.5e-35) and PREG(AC003672) protein from Arabidopsis (score: 3.1e-35) were 
found. But there is homology with (P20052|PH80_YEAST) PHOSPHATE SYSTEM 
CYCLIN PHO80 (score: 2.1e-10). With the BLASTN/nr we found AF051226 Picea 
mariana PREG-like protein (score: 3.9e-12). Functional domains are predicted at amino 
acid positions 61-168 anci 73-171 as comprising putative cyclin like interacting domains. 
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PHO80 itself shows similarity to the Saccharomyces cerevisiae G1 -specific cyclins 
HCS26 and OrfD (Kaffman Science 263 (1994) 1153-1155). The catalytic CDK subunit 
binding to PHO80 Is PH085, a CDK with roles in both the cell cycle and metabolic 
controls (Lenburg and O'Shea 1996. TIBS 21, p383-387). PHO80 in complex with 
PH085 regulates phosphatase gene expression. When inorganic phosphate in the 
medium is abundant the PHO80-PHO85 complex phosphorylates the PH04 
transcription factor Phosphorylated PH04 remains mainly cytoplasmic, resulting in the 
repression of expression of the PH05 phosphatase gene (O'Neill et al. 1996, Science 
271, p209-212). When cell are starved for phosphate, the PHO80-PHO85 complex is 
inhibited by the CDK inhibitor PH081, and transcription of PH05 is activated. 
The levels of PH05 expression are sensitive to the levels of PHO80- Overexpression of 
PHO80 results in a partial defect of PH05 activation when phosphate is limiting 
(Yoshida et al. 1989, MGG 217, p40"46; Madden et al. 1988. Nucleic Acids Res. 16, 
p2625-2637). At the other hand, deletion of PHO80 results in the presence of high 
levels of inorganic phosphate (Madden et al. 1988, Nucleic Acids Res. 16, p2625-2637). 
Similar effects can be expected for plants when the LDV24 genes is deleted or 
overexpressed. This might result in an adapted growth in conditions where organic 
phosphate is present at limiting or exceeding levels. More phosphate accumulation 
might positively affect the rate of plant growth and biomass production. 

7- LDV115(SEQ ID NO: 1) 

The LDV115 gene encodes a protein interacting with CDC2a but not with CDC2b and 
showing limited similarity to the Saccharomyces cerevisiae WEB1 protein. The publicly 
available databases were screended with the cDNA LDV115. With the BLASTX it was 
found as best homologue the WEB1 protein from S.pombe (AB004537)(score 6.7e-17). 
This protein as well as the other hits were mainly due to proline-richness of the LDV1 15 
translation. The homology is low but spread over about 50% of the S.pombe protein, 
which might indicate that LDV115 is at least a member of the family. The WEB1 gene 
was isolated as a yeast homologue of the adenoviral El A gene (Zieler et al., 1995, MCB 



15, p3227-3237). The protein products of the E1A gene are implicated in a variety of, 
transcriptional and cell cycle events, involving interactions with several proteins present 
in the human ceils, including parts of the transcriptional machinery and negative 
regulators of cell division such as the Rb gene product and p1 07. WEB1 is identical to 
SEC31, a protein involved in budding of transport vesicles from the endoplasmic 
reticulum (Pryer et al. 1993, J. Cell. Biol. 120, p865-875). The protein similarity between 
WEB1 and LDV115 is almost completely due to the presence of a proline-rich region 
found in both proteins. Proline-rich regions are not restricted to the WEB1 protein, but 
can also be found in many structural proteins such as hydroxyproline-rich glycoproteins 
and extensins. Therefore, LDV115 might not be a true homologue of WEB1. 

Example 4: Extension of cell cycle interacting protein encoding polynucleotides 
to full length or to recover regulatory elements ^ 

The cell cycle interacting protein encoding nucleic acid sequences (SEQ ID NOS: 1, 3, 
5, 7, 9, 11 and 13) are used to design oligonucleotide primers for extending a partial 
nucleotide sequence to full length or for obtaining 5' sequences from genomic libraries. 
One primer is synthesized to initiate extension in the antisense direction (XLR) and the 
other is synthesized to extend sequence in the sense direction (XLF). Primers allow the 
extension of the known cell cycle interacting protein encoding sequence "outward" 
generating amplicons containing new, unknown nucleotide sequence for the region of 
interest. The initial primers are designed from the cDNA using OLIGO® 4.06 Primer 
Analysis Software (National Biosciences), or another appropriate program, to be 
preferably 22-30 nucleotides in length, to have a GC content of preferably 50% or more, 
and to anneal to the target sequence at temperatures preferably about 68*'-72°C. Any 
stretch of nucleotides which would result in hairpin structures and primer-primer 
dimerizations is avoided. The original, selected cDNA libraries, prepared from mRNA 
isolated from actively dividing cells or a plant genomic library are used to extend the 
sequence; the latter is most useful to obtain 5' upstream regions. If more extension is 
necessary or desired, additional sets of primers are designed to further xtend the 
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known region. By following the instructions for the XL-PCR kit (Perkin Elmer) and 
thoroughly mixing the enzyme and reaction mix, high fidelity amplification is obtained. 
Beginning with 40 pmol of each primer and the recommended concentrations of all other 
components of the kit. PGR is perfonned suing the Peltier Thermal Cycle (PTC200; MJ 
Research, Watertown MA) and the following parameters: 



Stepi 


94°C for 1 min (initial denaturation) 


Step 2 


65°C for 1 min 


Step 3 


eS^C for 6 min 


Step 4 


94° for 15 sec 


Step 5 


eS'C for 1 min 


Step 6 


68°C for 7 min 


Step 7 


Repeat steps 4-6 for 15 additional cycles 


Steps 


94°Cfor 15sec 


Step 9 


eS^C for 1 min 


Step 10 


68X for 7: 16 min 


Step 1 1 


Repeat step 8-10 for 12 cycles 


Step 12 


72''C for 8 min 


Step 13 


4''C (and holding) 



A 5-10 Ml aliquot of the reaction mixture is analyzed by electrophoresis on a low 
concentration (about 0.6-0.8%) agarose mini-gel to determine which reactions were 
sudd^^fur in extendlhg products 
were selected and cut out of the gel. Further purification involves using a commercial 
gel extraction method such as QIAQuick™ (QIAGEN Inc). After recovery of the DNA, 
Klenow enzyme was used to trim single-stranded, nucleotide overhangs creating blunt 
ends which facilitate religation and cloning. After ethanol precipitation, the products are 
redissolved in 13 Ml ligation buffer. Ipl T4-DNA ligase (15 units) and 1 \i\ T4 
polynucleotide kinase are added, and the mixture is incubated at room temperature for 
2-3 hours or overnight at 16°C. Competent E. coli cells (in 40 m' of appropriate media) 
are transformed with 3 ul of ligation mixture and cultured in 80 ii\ of SOC medium 
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(Sambrook, supra). After incubation for one hour at SZ'^C, the whole transformation 
mixture is plated on Luria Bertani (LB)-agar (Sambrook, supra) containing 2xCarb. The 
following day, se^rarcolbnief^r€i"fandoml^ picked from each plate sihd cultured in 150 
pi of liquid LB/2xCarb medium placed in an individual well^-^ of .an appropriate, 
commerically-avallable. sterile 96-well microtiter plate. The following day^*^5 pi of each 
overnight cultures js traii§feq^d Into a non-^terjle 96:^ell plate and aftep dilution 1:10 with 
water, 5 pi of each sample is transferred into a PGR array. For PGR ampliflcatron, 18 pi 
of concentrated PGR reaction mix (3.3x) containing 4 units of 4Tth DNA polymerase, a 
vector primer and both of the gene specific primers used for the extension reaction are 
added to each well. Amplification is performed using the following conditions: 



Step 1 94X for 60 sec 

Step 2 94°C for 20 sec 

Step 3 55**C for 30 sec 

Step 4 72'C for 90 sec 

Step 5 Repeat steps 2-4 for an additional 29 cycles m 

Steps 72"Gfor 180sec 

Step 7 4"C (and holding) 



Aliquots of the PGR reactions are run on agarose gels together with molecular weight 
and appropriate clones are selected, ligated Into plasmid and sequenced. 



.7.l?12.r?l:i 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: CropDesign N.V, 

(B) STREET: Technologiepark 3 

(C) CITY: Zwijnaarde-Gent 

(D) STATE: none 

(E) COtJNTRY: Belgium 

(F) POSTAL CODE (ZIP) : 9052 

(ii) TITLE OF INVENTION: Novel cell cycle genes and uses 

thereof 

(iii) NUMBER OF SEQUENCES: 20 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19B9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(ii) MOLEOJLE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2 . . 1672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

A ACG CAA GAA ATG CAA GAA GAA GAG GAA GAA AGT TCT GAC CCA GTT 46 
Thr Gin Glu Met Gin Glu Glu Glu Glu Glu Ser Ser Asp Pro Val 
15 10 15 
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TTT GAT AAT GCC ATC CAG CGA GCG TTG ATT GTT GGA GAT TAC AAG GAG 94 
Phe Asp Asn Ala lie Gin Arg Ala Leu lie Val Gly Asp Tyr Lys Glu 
20 25 30 

GCG GTG GAT CAG TGT ATA ACT GCA AAT AAG ATG GCC GAT GCT TTA GTT 142 
Ala Val Asp Gin Cys lie Thr Ala Asn Lys Met Ala Asp Ala Leu Val 
35 40 45 

ATT GCT CAT GTT GGT GGT ACA GCG TTG TGG GAG AGT ACT CGT GAG AAA. 190 
lie Ala His Val Gly Gly Thr Ala Leu Trp Glu Ser Thr Arg Glu Lys 
50 55 60 

TAT TTG AAG ACG AAC AGT GCG CCA TAC ATG AAG GTT GTT TCT GCG ATG 238 
Tyr Leu Lys Thr Asn Ser Ala Pro Tyr Met Lys Val Val Ser Ala Met 
65 70 75 

GTG AAC AAT GAT CTC AGG AGC CTT ATC TAT ACA AGG TCA CAT AAG TTC 286 
Val Asn Asn Asp Leu Arg Ser Leu lie Tyr Thr Arg Ser His Lys Phe 
80 85 90 95 

TGG AAA GAG ACT CTT^GCT CTG CTC TGT-ACT TTT GCA CAA GGA GAA CAA 334 
Trp Lys Glu Thr Leu Ala Leu Leu Cys- Thr* -Phe Ala Gin Gly Glu Gin 
100^ > 105^ 110 

TGG ACA ACC CTG TGT GAT GCG; CTT GCG TCG AAG TTG ATG GCT GCT GGT"- 382 
Trp Thr Thr Leu Cys Asp^^-Ala Leu Ala Ser Lys- Leut^iMefe^vAla Ala GLy 
115 120 : 125 

AAC ACT TTG GCT GCA GTT CTC TGC TAC ATT TGC GCA GGG"^ AAT GCT GAC 430 
Asn Thr Leu Ala Ala Val Leu Cys Tyr lie Cys Ala Gly Asn Val Asp 
130 135 140 

AGA ACA GTA GAA ATT TGG TCG AGG AGC CTT GCA AAT GAG CGG GAT GGA 478 
Arg Thr Val Glu lie Trp Ser Arg Ser Leu Ala Asn Glu Arg Asp Gly 
145 150 155 

AGA TCT TAT GCT GAG CTT CTT CAA GAT CTT ATG GAG AAG ACT CTT GTC 526 
Arg Ser Tyr Ala Glu Leu Leu Gin Asp Leu Met Glu Lys Thr Leu Val 
160:^ ^ 165 _ 170 175 

CTT GCT TTG GCA ACT GGC AAC AAA AAG TTC AGC GCA TCT CTG TGT AAA 574 
Leu Ala Leu Ala Thr Gly Asn Lys Lys Phe Ser Ala Ser Leu Cys Lys 
180 185 190 

CTC TTT GAG AGT TAT GCT GAG ATA CTG GCC AGC CAA GGG CTT CTT ACA 622 
Leu Phe Glu Ser Tyr Ala Glu lie Leu Ala Ser Gin Gly Leu Leu Thr 
195 200 205 

ACG GCA ATG AAG TAC TTG AAA GTT CTG GAT TCT GGT GGC TTG TCA CCT 670 
Thr Ala Met Lys Tyr Leu Lys Val Leu Asp Ser Gly Gly Leu Ser Pro 
210 215 220 

GAA CTT TCA ATA TTA CGT GAT CGT ATT TCT CTA TCT GCA GAA CCT GAG 718 
Glu Leu Ser lie Leu Arg Asp Arg lie Ser Leu Ser Ala Glu Pro Glu 
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225 230 235 

ACT AAC ACT ACA GCT TCA GGA AAC ACT CAG CCT CAA AGC ACC ATG CCA 766 
Thr Asn Thr Thr Ala Ser Gly Asn Thr Gin Pro Gin Ser Thr Met Pro 
240 245 250 255 

TAT AAT CAG GAG CCA ACT CAG GCG CAA CCA AAC GTT CTT GCT AAC CCA 814 
Tyr Asn Gin Glu Pro Thr Gin Ala Gin Pro Asn Val Leu Ala Asn Pro 
260 265 270 

TAT GAT AAT CAG TAT CAG CAA CCG TAC ACT GAT TCT TAT TAT GTC CCT 862 
Tyr Asp Asn Gin Tyr Gin Gin Pro Tyr Thr Asp Ser Tyr Tyr Val Pro 
275 280 285 

CAA GTT TCA CAT CCA CCC ATG CAG CAA CCA ACC ATG TTT ATG CCA CAC 910 
Gin Val Ser His Pro Pro Met Gin Gin Pro Thr Met Phe Met Pro His 
290 295 300 

CAA GCT CAG CCA GCT CCG CAG CCA TCT TTT ACT CCA GCT CCT ACA AGC 958 
Gin Ala Gin Pro Ala Pro Gin Pro Ser Phe Thr Pro Ala Pro Thr Ser 
305 310 315 

AAT GCT CAG CCA TCC ATG AGA ACT ACA TTT GTT CCT TCA ACT CCC CCT 1006 
Asn Ala Gin Pro Ser Met Arg Thr Thr Phe Val Pro Ser Thr Pro Pro 
320 325 330 335 

GCA CTG AAG AAT GCA GAT CAA TAT CAG CAG CCA ACC ATG AGT TCT CAT 1054 
Ala Leu Lys Asn Ala Asp Gin Tyr Gin Gin Pro Thr Met Ser Ser His 
340 345 350 

TCA TTC ACG GGA CCA TCT AAC AAT GCA TAC CCT GTT CCC CCG GGT CCT 1102 
Ser Phe Thr Gly Pro Ser Asn Asn Ala Tyr Pro Val Pro Pro Gly Pro 
355 360 365 

GGT CAA TAT GCA CCT TCT GGC CCT TCA CAA CTT GGG CAA TAT CCT AAC 1150 
Gly Gin Tyr Ala Pro Ser Gly Pro Ser Gin Leu Gly Gin Tyr Pro Asn 
370 375 380 

CCT AAG ATG CCC CAA GTT GTT GCT CCA GCA GCT GGA CCC ATA GGA TTT 1198 
Pro Lys Met Pro Gin Val Val Ala Pro Ala Ala Gly Pro lie Gly Phe 
385 390 395 

ACG CCC ATG GCA ACT CCA GGA GTT GCT CCA AGA TCT GTG CAA CCA GCA 1246 
Thr Pro Met Ala Thr Pro Gly Val Ala Pro Arg Ser Val Gin Pro Ala 
400 405 410 415 

AGT CCT CCA ACA CAG CAG GCA GCT GCA CAG GCA GCC CCT GCG CCT GCA 12 94 

Ser Pro Pro Thr Gin Gin Ala Ala Ala Gin Ala Ala Pro Ala Pro Ala 
420 425 430 

ACT CCG CCA CCA ACT GTT CAG ACT GCA GAT ACT TCC AAC GTT CCA GCC 1342 
Thr Pro Pro Pro Thr Val Gin Thr Ala Asp Thr Ser Asn Val Pro Ala 
435 440 445 

CAC CAG AAA CCT GTG ATA GCA ACG TTG ACA AGG CTT TTC AAT GAG ACA 13 90 
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His Gin Lys Pro Val lie Ala Thr Leu Thr Arg Leu Phe Asn Glu Thr 
450 455 460 

TCG GAA GCA CTG GGA GGC GCA COT GCG AAT ACT ACT AAG AAG CGT GAG 1438 
Ser Glu Ala Leu Gly Gly Ala Arg Ala Asn Thr Thr Lys Lys Arg Glu 
465 470 475 

ATA GAA GAG AAC TCG AGA AAA TTA GGT GCT CTG TTT GTG AAA CTC AAG = 1486 
lie Glu Asp^Asn Ser Arg Lys Leu Gly Ala Leu Phe Val Lys. Leu Asn 
480 48^ 490 495 

AGC GGA GAC ATC TCC AAG AAT GCT GCG GAC AAA CTC GCA CAG CTA TGC 1534 
Ser Gly Asp lie Ser Lys Asn Ala Ala Asp Lys Leu Ala Gin Leu Cys 
500 505 510 

CAA GCT CTG GAC AAC AAT GAC TTC AGC ACA GCC CTT CAA ATA CAG GTA 1582 
Gin Ala Leu Asp Asn Asn Asp Phe Ser Thr Ala Leu Gin lie Gin Val 
515 520 525 

CTT CTG ACT ACC AGC GAA TGG GAC GAA TGC AAC TTC TGG CTG GCA ACA 1630 
Leu Leu Thr Thr Ser Glu Trp Asp Glu Cys Asn Phe Trp Leu Ala Thr 
530 535 540 

CTA AAG CGG ATG- ATG GTG AAG ^ GGC- AGG CAAs#AAT- GTGxf^GGG^TGAOSFATTTA^ * 1679 
Leu Lys Arg Metf^Met Val^, Lys^Ala,>Arg ^Gln 

545 550*^ 5551^*^ 

TTTTCTGGTT CATGGATTTT * TTTTTTTATA*AATCTTAAGG *rAG6GAGGTGT^GTAT<HAAACT 1739 

CCTTTTGCTT TCTTAATTTT ^ GGTrri'TiTA^AAACGCCGTr GGTGGTCTAA T TT TTT T TTT 1799 

TTTTTTTTTT TGTCATTTAT GAAGCTCATG*- TGCTACTTCC AGTTAC-TTTT TTGTTTAGAT 1859 

AGTATAGAGA TCATCAGATT GCTGAAACAT TTTCATGTTT TTGGATGTTA CTTTACCGGA 1919 

CAAGTTTGTT CGTCTTTCTT TCTAAAAAAA AAAAAAAAAA AAAAAAAAAA ACAAAAAAAA 1979 

AAAAAAAAAA 1989 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH'f* 556 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLEGULE^TYPE:- protein ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

Thr Gin Glu Met Gin Gl x Glu Glu Glu Glu Ser Ser Asp Pro Val Phe 
1 5 10 15 

Asp Asn Ala lie Gin Arg Ala Leu lie Val Gly Asp Tyr Lys Glu Ala 



^1 HI 
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20 25 30 

Val Asp Gin Cys lie Thr Ala Asn Lys Met Ala Asp Ala Leu Val lie 
35 40 45 

Ala His Val Gly Gly Thr Ala Leu Trp Glu Ser Thr Arg Glu Lys Tyr 
50 55 60 

Leu Lys Thr Asn Ser Ala Pro Tyr Met Lys Val Val Ser Ala Met Val 
65 70 75 80 

Asn Asn Asp Leu Arg Ser Leu lie Tyr Thr Arg Ser His Lys Phe Trp 
85 90 95 

Lys Glu Thr Leu Ala Leu Leu Cys Thr Phe Ala Gin Gly Glu Gin Trp 
100 105 110 

Thr Thr Leu Cys Asp Ala Leu Ala Ser Lys Leu Met Ala Ala Gly Asn 
115 120 125 

Thr Leu Ala Ala Val Leu Cys Tyr lie Cys Ala Gly Asn Val Asp Arg 
130 135 140 

Thr Val Glu He Trp Ser Arg Ser Leu Ala Asn Glu Arg Asp Gly Arg 
145 150 155 160 

Ser Tyr Ala Glu Leu Leu Gin Asp Leu Met Glu Lys Thr Leu Val Leu 
165 170 175 

Ala Leu Ala Thr Gly Asn Lys Lys Phe Ser Ala Ser Leu Cys Lys Leu 
180 185 190 

Phe Glu Ser Tyr Ala Glu He Leu Ala Ser Gin Gly Leu Leu Thr Thr 
195 200 205 

Ala Met Lys Tyr Leu Lys Val Leu Asp Ser Gly Gly Leu Ser Pro Glu 
210 215 220 

Leu Ser He Leu Arg Asp Arg lie Ser Leu Ser Ala Glu Pro Glu Thr 
225 230 235 240 

Asn Thr Thr Ala Ser Gly Asn Thr Gin Pro Gin Ser Thr Met Pro Tyr 
245 250 255 

Asn Gin Glu Pro Thr Gin Ala Gin Pro Asn Val Leu Ala Asn Pro Tyr 
260 265 270 

Asp Asn Gin Tyr Gin Gin Pro Tyr Thr Asp Ser Tyr Tyr Val Pro Gin 
275 280 285 

Val Ser His Pro Pro Met Gin Gin Pro Thr Met Phe Met Pro His Gin 
290 295 300 

Ala Gin Pro Ala Pro Gin Pro Ser Phe Thr Pro Ala Pro Thr Ser Asn 
305 310 315 320 
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Ala Gin Pro Ser Met Arg Thr Thr Phe Val Pro Ser Thr Pro Pro Ala 
325 330 335 

Leu Lys Asn Ala Asp Gin Tyr Gin Gin Pro Thr Met Ser Ser His Ser 
340 345 350 

Phe Thr Gly Pro Ser Asn Asn Ala Tyr Pro Val Pro Pro Gly Pro Gly 
355 360 365 

Gin Tyr Ala Pro Ser Gly Pro Ser Gin Leu Gly Gin Tyr Pro Asn Pro 
370 375 380 

Lys Met Pro Gin Val Val Ala Pro Ala Ala Gly Pro He Gly Phe Thr 
385 390 395 400 

Pro Met Ala Thr Pro Gly Val Ala Pro Arg Ser Val Gin Pro Ala Ser 
405 410 415 

Pro Pro Thr Gin Gin Ala Ala Ala Gin Ala Ala Pro Ala Pro Ala Thr 
420 425 430 

Pro Pro Pro Thr Val Gin Thr Ala Asp Thr* Ser Asn-Val^^Pro Ala ,His 
435 440 - 445v ^ 

Gin Lys Pro Val He Ala-Thrf^^Leu Thr Arg Leu Phe-*Asn* Glu Thr*^Sei?- . 
450 455*^% 460**^ 

Glu Ala Leu Gly Gly Ala^ Arg Ala Asn »tThr*^Thr Lys Lys Arg Glu lie 
465 470^ 475 480 

Glu Asp Asn Ser Arg Lys Leu^Gly Ala Leu Phe Val Lys -^Leu Asn^^^jSer 
485 490 495 

Gly Asp He Ser Lys Asn Ala Ala Asp Lys Leu Ala Gin Leu Cys Gin 
500 505 510 

Ala Leu Asp Asn Asn Asp Phe Ser Thr Ala Leu Gin He Gin Val Leu 
515 520 525 

Leu Thr Thr Ser Glu frp Asp Glu Cys Asn Phe Trp Leu Ala Thr lieu 
530 535 540 

Lys Arg Met Met Val Lys Ala Arg Gin Asn Val Arg 
545 550 555 



(2) INFORMATION FOR SEQ ID NO: 3t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



i 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 665 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 : 

GG GAC TCT CTC GCA ACC GAT CCA GCT TTC ATT GAT TCG GAT GTA TAG 47 
Asp Ser Leu Ala Thr Asp Pro Ala Phe lie Asp Ser Asp Val Tyr 
15 10 15 

CTC AGG TTA GGA CTT ATT ATT GAG GGC AAA CGA TTG AAA AAG CCA CCG 95 
Leu Arg Leu Gly Leu lie lie Glu Gly Lys Arg Leu Lys Lys Pro Pro 
20 25 30 

ACT GTT CTC TCA CGC CTC TCT TCT TCT CTG GAG AGA TCT CTG TTA CTC 143 
Thr Val Leu Ser Arg Leu Ser Ser Ser Leu Glu Arg Ser Leu Leu Leu 
35 40 45 

AAT CAT GAT GAC AAG ATT CTG CTT GGA TCG CCA GAC TCT GTT ACC GTG 191 
Asn His Asp Asp Lys lie Leu Leu Gly Ser Pro Asp Ser Val Thr Val 
50 55 60 

TTT GAC GGG AGA TCT CCC CCT GAG ATC AGT ATT GCA CAC TAC TTG GAT 239 
Phe Asp Gly Arg Ser Pro Pro Glu lie Ser lie Ala His Tyr Leu Asp 
65 70 75 

CGC ATT TTC AAG TAC TCT TGC TGC AGT CCC TCC TGC TTC GTC ATT GCG 287 
Arg lie Phe Lys Tyr Ser Cys Cys Ser Pro Ser Cys Phe Val lie Ala 
80 85 90 95 

CAT ATC TAC ATT GAT CAC TTT CTC CAT AAG ACC CGA GCC CTT CTC AAA 335 
His lie Tyr lie Asp His Phe Leu His Lys Thr Arg Ala Leu Leu Lys 
100 105 110 

CCC CTT T^T GTC CAC CGC CTT ATC ATT ACA ACT GTC ATG TTA GCT GCT 383 
Pro Leu Asn Val His Arg Leu lie lie Thr Thr Val Met Leu Ala Ala 
115 120 125 

AAA GTC TTC GAT GAT AGG TAT TTC AAC AAT GCA TAC TAC GCA AGA GTG 431 
Lys Val Phe Asp Asp Arg Tyr Phe Asn Asn Ala Tyr Tyr Ala Arg Val 
130 135 140 

GGA GGT GTG ACT ACG AGA GAG TTA AAC AGA TTG GAG ATG GAG TTG TTG 479 
Gly Gly Val Thr Thr Arg Glu Leu Asn Arg Leu Glu Met Glu Leu Leu 
145 150 155 

TTT ACC CTT GAC TTC AAG CTT CAG GTA GAT CCT CAG ACG TTT CAC ACA 527 
Phe Thr Leu Asp Phe Lys Leu Gin Val Asp Pro Gin Thr Phe His Thr 
160 165 170 175 
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CAC TGT TGT CAG TTA GAA AAG CAG AAC AGA GAC GGC TTC CAG ATC GAG 575 
His Cys Cys Gin Leu Glu Lys Gin Asn Arg Asp Gly Phe Gin lie Glu 
180 185 190 

TGG CCC ATA AAA GAA GCA TGC CGA GCC AAC AAA GAG ACT TGG CAG AAG 623 
Trp Pro lie Lys Glu Ala Cys Arg Ala Asn Lys Glu Thr Trp Gin Lys 
195 200 205 

AGG ACA CCC GAC TCA TTC TGC TCT CAA ACC ACA GCA CGC TGATCGGGAA 672 
Arg Thr Pro Asp Ser Phe Cys Ser Gin Thr Thr Ala Arg 
210 215 220 

GGGTAAGATA GGATTATTTT GTGTTTTAGT AGTGATGATT CTTTTGCATG ATTGATTGTT 732 

TGTGACAATT GTGTGTAGTA GAAAATCTGA AAATTTCTAC CAACTCATTC TTTAAGAAGT 792 

TGCTAAAAAA AAAAAAAAAA AA 814 

(2) INFORMATION FOR SEQ ID NO: 4: 

( i ) SEQUENGExGHARACTERISTICS : 

(A) LENGTH*:it220'^*amimo^ acids 

(B) TyPE:%amd;iio,i;;aca^i^ *r 
(D ) TOPOLQGY*^: wl inea*Ei*i* 

( i i ) MOLECULE..,TYa3E :»|s?proted?nmr 

(xi) SEQUENCE DESGRIPTiON*:»«»SEQ ID NO^it 4 : 

Asp Ser Leu Ala Thr ^Aspiii?Pro Ala -Phe lie Asp .Se^^ Leu 
1 5 10 15 

Arg Leu Gly Leu lie lie Glu Gly Lys Arg Leu Lys Lys Pro Pro Thr 
20 25 30 

Val Leu Ser Arg Leu Ser Ser Ser Leu Glu Arg Ser Leu Leu Leu Asn 
35 40 45 



His Asp Asp Lys lie Leu Leu Gly Ser Pro Asp Ser Val Thr Val Phe 
50 55 60 

Asp Gly Arg Ser Pro Pro Glu lie Ser lie Ala His Tyr Leu Asp Arg 

70 75 80 

lie Phe Lys Tyr Ser Cys Cys Ser Pro Ser Cys Phe Val lie Ala His 
85 90 95 

lie Tyr lie Asp His Phe Leu His Lys Thr Arg Ala Leu Leu Lys Pro 
100 105 110 

Leu Asn Val His Arg Leu lie lie Thr Thr Val Met Leu Ala Ala Lys 
115 120 125 



Val Phe Asp Asp Arg Tyx Phe Asn Asn Ala Tyx Tyr Ala Arg Val Gly 
130 135 140 



Gly Val Thr Thr Arg Glu Leu Asn Arg Leu Glu Met Glu Leu Leu Phe 
145 150 155 160 

Thr Leu Asp Phe Lys Leu Gin Val Asp Pro Gin Thr Phe His Thr His 
165 170 175 

Cys Cys Gin Leu Glu Lys Gin Asn Arg Asp Gly Phe Gin lie Glu Trp 
180 185 190 

Pro lie Lys Glu Ala Cys Arg Ala Asn Lys Glu Thr Trp Gin Lys Arg 
195 200 205 

Thr Pro Asp Ser Phe Cys Ser Gin Thr Thr Ala Arg 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY, linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1269 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AAT TCG GCA CGA GGC CTT CTC CAG CTT CAT CCT TGC AAC AAG GTG GTA 
Asn Ser Ala Arg Gly Leu Leu Gin Leu His Pro Cys Asn Lys Val Val 
1 5 10 15 

CTC TGG GGT CTT TCT CAT CAG ATA TTT GTC GGC TGC TGC AGC TCT GTG 
Leu Trp Gly Leu Ser His Gin lie Phe Val Gly Cys Cys Ser Ser Val 
20 25 30 

ATG GAA GAT GAT GCT ACC AGC AAA TTA GCT GCC CCC AAG CCC GAG CCT 
Met Glu Asp Asp Ala Thr Ser Lys Leu Ala Ala Pro Lys Pro Glu Pro 
35 40 45 

GCT GAT CAG AAT CTC GAA GCT GGC AAA GCT GCT GTC TTC CAA AGG GGA 
Ala Asp Gin Asn Leu Glu Ala Gly Lys Ala Ala Val Phe Gin Arg Gly 
50 55 60 
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TAG AAT TTG GTT GAG GGG AAG TCA GAA CAT GGA TTA CCA TTG GTT GAT 
Tyr Asn Leu Val Gin Gly Lys Ser Glu His Gly Leu Pro Leu Val Asp 
65 70 75 80 
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AAT TGC AAA GAT TTG TCC TTA GCA GCT GGT AAC AAT TTC GAT GGA ACG 
Asn Cys Lys Asp Leu Ser Leu Ala Ala Gly Asn Asn Phe Asp Gly Thr 
85 90 95 
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GCT CCT TTG GAG TAT CAT CAG CAG TAT GAT CTG CAA CAA GAG TTT GAA 
Ala Pro Leu jGlu Tyr His Gin Gin Tyr Asp Leu Gin Gin Glu Phe Glu 
100 105 110 
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CCA AAC TTC AAT GGT GGT TTC AAC AAT TGT CCC AGT TAT GGT GTA GTA 
Pro Asn Phe Asn Gly Gly Phe Asn Asn Cys Pro Ser Tyr Gly Val Val 
115 120 125 



3 84 



GAG GGT CCT ATA CAT ATC TCT AAT TTT ATC CCG ACT ATT TGT CCT CAC 
Glu Gly Pro lie His lie Ser Asn Phe lie Pro Thr lie Cys Pro His 
130 135 140 



432 



CCT CTG CAT TCT TGG GTC CAA AAA TGT GCT CTT TGG GAT TGC CCT AGC 
Pro Leu His Ser Trp Val% Glii Lys Cys Ala Leu Trp Asp Cys Pro Ser 
145 150*^ 155%^ 160 



480 



CAG CTC AGG GAT TTG ATT GGG TCC AGG ATT^^ACT GCA ?GCA GCT >TCGf^ ACG 
Gin Leu Arg Asp Leu Ile> Glvy: Ser Arg lie yThr^^Ala Ala Ala Setr Thi^^ 
165 170^1 175^ 



528 



CTG CAC TGG CTT TCA AAT GAA * AGG GCC ACC AGG TAT -GAA TCC GGT GTG^ 
Leu His Trp Leu Ser Asn- Glu Arg Ala Thr'^Arg Tyr Glu Ser Gly.^Val 
180- 185"r' 190 



576 



CGT CCT GGA AGT ATC GGC CTA AAA GAC GGT CTG CTT TTT GCT GCT CTT 
Arg Pro Gly Ser lie Glv Leu Lys Asp Gly Leu Leu Phe Ala Ala Leu 
195 200 205 



624 



AGT GCA AAG GCT GGA GGG AAA GAT GTT GGT ATT CCC GAA TQT GAA GGA 
Ser Ala Lys Ala Gly Gly Lys Asp Val Gly lie Pro Glu Cys Glu Gly 
210 215 220 



GCT GCA ACT GCT AAA TCT CCA TGG AAT GCT CCA GAG CTC TTT GAT CTC 
Ala Ala Thr Ala Lys Ser Pro Trp Asn Ala Pro Glu Leu Phe Asp Leu 
225 230 235 240 



720 



ACG GTT CTG GAG AGT GAG ACA CTA AGG GAG TGG CTA TTC TTT GAC AAG 
Thr Val Leu Glu Ser Glu Thr Leu Arg Glu Trp Leu Phe Phe Asp Lys 
245 250 255 



768 



CCA AGG AGG GCC TTT GAG AGC GGG AAC AGA AAG CAA AGA TCT TTA CCA 
Pro Arg Arg Ala Phe Glu Ser Gly Asn Arg Lys Gin Arg Ser Leu Pro 
260 265 270 



816 



GAC TAC AAT GGT CGT GGT TGG CAC GAG TCA CGT AAA CAG ATC ATG GTC 
Asp Tyr Asn Gly Arg Gly Trp His Glu Ser Arg Lys Gin lie Met Val 
275 280 285 



864 



Ilml 



85 



GAG TTT GGA GGG CTG AAG AGA TCT TAG TAG ATG GAT CCA CAG CCT CTG 912 
Glu Phe Gly Gly Leu Lys Arg Ser Tyr Tyr Met Asp Pro Gin Pro Leu 
290 295 300 

CAC CAT TTC GMl TGG CAT CTT TAG GAA TAT GAG ATC AAC AAG TGT GAT 960 
His His Phe Glu Trp His Leu Tyr Glu Tyr Glu lie Asn Lys Cys Asp 
305 310 315 320 

GCT TGT GCC TTG TAG AGG CTC GAG CTC AAG CTT GTT GAC GGG AAG AAG 1008 
Ala Cys Ala Leu Tyr Arg Leu Glu Leu Lys Leu Val Asp Gly Lys Lys 
325 330 335 

ACT TCA AAA GGC AAA GTC TCA AAC GAC TCA GTG GCT GAT CTG CAG AAG 1056 
Thr Ser Lys Gly Lys Val Ser Asn Asp Ser Val Ala Asp Leu Gin Lys 
340 345 350 

CAG ATG GGA AGA CTC ACA GCT GAG TTC CCT CCA GAA AAC AAT ACC ACT 1104 
Gin Met Gly Arg Leu Thr Ala Glu Phe Pro Pro Glu Asn Asn Thr Thr 
355 360 365 

AAC ACC ACC AAC AAC TVAC AAA CGC TGC ATC AAA GGA AGA CCA AAA GTG 1152 
Asn Thr Thr Asn Asn Asn Lys Arg Cys lie Lys Gly Arg Pro Lys Val 
370 375 380 

AGC ACA AAA GTC GCC ACC GGG AAT GTT CAG AAC ACA GTA GAG CAG GCA 1200 
Ser Thr Lys Val Ala Thr Gly Asn Val Gin Asn Thr Val Glu Gin Ala 
385 390 395 400 

AAT GAC TAT GGA GTA GGT GAA GAG TTT AAC TAT CTG GTC GGA AAT CTA 1248 
Asn Asp Tyr Gly Val Gly Glu Glu Phe Asn Tyr Leu Val Gly Asn Leu 
405 410 415 

AGC GAC TAT TAT ATC CCC TG 1269 
Ser Asp Tyr Tyr lie Pro 
420 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: ammo acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Asn Ser Ala Arg Gly Leu Leu Gin Leu His Pro Cys Asn Lys Val Val 
1 5 10 15 

Leu Trp Gly Leu Ser His Gin lie Phe Val Gly Cys Cys Ser Ser Val 
20 25 30 



86 



Met Glu Asp Asp Ala Thr Ser Lys Leu Ala Ala Pro Lys Pro Glu Pro 
35 40 45 

Ala Asp Gin Asn Leu Glu Ala Gly Lys Ala Ala Val Phe Gin Arg Gly 
50 55 60 

Tyr Asn Leu Val Gin Gly Lys Ser Glu His Gly Leu Pro Leu Val Asp v 
65 70 75 80 

Asn Cys Lys Asp Leu Ser. Leu Ala Ala Gly Asn Asn Phe Asp Gly Thr^ 
85 90 95 

Ala Pro Leu Glu Tyr His Gin Gin Tyr Asp Leu Gin Gin Glu Phe Glu 
100 105 110 

Pro Asn Phe Asn Gly Gly Phe Asn Asn Cys Pro Ser Tyr Gly Val Val 
115 120 125 

Glu Gly Pro lie His lie Ser Asn Phe lie Pro Thr lie Cys Pro His 
130 135 140 

Pro Leu His Ser Trp Val Gin Lys Cys Ala Leu Trp Asp Cys Pro Ser 
145 ISO 155 160 

Gin Leu Arg Asp Leu lie Gly Ser Arg lie Thr Ala Ala Ala Ser Thr 
165 170 175 

Leu His Trp Leu Ser As-i Glu Arg Ala Thr Arg Tyr Glu Ser Gly -Val^ 
180 185 * 190 

Arg Pro Gly Ser lie Gly Leu Lys Asp Gly Leu Leu Phe Ala Ala Leu 

195 200-^, 205 

Ser Ala Lys Ala Gly Gly Lys Asp Val Gly He Pro Glu Cys Glu Gly 
210 215 220 

Ala Ala Thr Ala Lys Ser Pro Trp Asn Ala Pro Glu Leu Fhe Asp Leu 
225 230 235 240 

Thr Val Leu Glu Ser Glu Thr Leu Arg Glu Trp Leu Phe Phe Asp Lys 
245 ' 250 ' 255 

Pro Arg Arg Ala Phe Glu Ser Gly Asn Arg Lys Gin Arg Ser Leu Pro 
260 265 270 

Asp Tyr Asn Gly Arg Gly Trp His Glu Ser Arg Lys Gin He Met Val 
275 280 285 

Glu Phe Gly Gly Leu Lys Arg Ser Tyr Tyr Met Asp Pro Gin Pro Leu 
290 295 300 

His His Phe Glu Trp His Leu Tyr Glu Tyr Glu He Asn Lys Cys Asp 
305 310 315 320 

Ala Cys Ala Leu Tyr Arg Leu Glu Leu Lys Leu Val Asp Gly Lys Lys 
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325 330 335 

Thr Ser Lys Gly Lys Val Ser Asn Asp Ser Val Ala Asp Leu Gin Lys 
340 345 350 

Gin Met Gly Arg Leu Thr Ala Glu Phe Pro Pro Glu Asn Asn Thr Thr 
355 360 365 

Asn Thr Thr Asn Asn Asn Lys Arg Cys lie Lys Gly Arg Pro Lys Val 
370 375 380 

Ser Thr Lys Val Ala Thr Gly Asn Val Gin Asn Thr Val Glu Gin Ala 
385 390 395 400 

Asn Asp Tyr Gly Val Gly Glu Glu Phe Asn Tyr Leu Val Gly Asn Leu 
405 410 415 

Ser Asp Tyr Tyr lie Pro 
420 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..654 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAA TTC GGC ACG AGC TCC TTC CTC GGC TGT AAC AAG ATA GAG AAG AAG 48 
Glu Phe Gly Thr Ser Ser Phe Leu Gly Cys Asn Lys He Glu Lys Lys 
15 10 15 

ATG AAT ATG GAA GTG GAT ACA GTA ACA AGG AAG CCT CGT ATC TTA CTA 96 
Met Asn Met Glu Val Asp Thr Val Thr Arg Lys Pro Arg He Leu Leu 
20 25 30 

GCT GCA AGT GGA AGT GTG GCT TCA ATT AAG TTC AGT AAT CTC TGC CAT 144 
Ala Ala Ser Gly Ser Val Ala Ser He Lys Phe Ser Asn Leu Cys His 
35 40 45 

TGT TTC TCA GAA TGG GCT GAA GTC AAA GCC GTC GCT TCA AAA TCA TCT 192 
Cys Phe Ser Glu Trp Ala Glu Val Lys Ala Val Ala Ser Lys Ser Ser 



^2-1 9961 



88 



f 



50 55 60 

CTC AAT TTC GTT GAT TWUV CCT TCT CTA CCT CAG AAT GTG ACT CTC TAT 240 
Leu Asn Phe Val Asp Lys Pro Ser Leu Pro Gin Asn Val Thr Leu Tyr 
65 70 75 80 

ACA GAT GAA GAT GAA TGG TCT AGC TGG AAC AAG ATT GGT GAT CCC GTT 288 
Thr Asp Glu Asp Glu Trp Ser Ser Trp Asn Lys lie Gly Asp Pro Val^- 
85 90 95- 

CTT CAT ATC GAG CTC AGA CGC TGG GCT GAT GTT ATG ATC ATT GCT CCT 336 
Leu His lie Glu Leu Arg Arg Trp Ala Asp Val Met He He Ala Pro 
100 105 110 

TTG TCT GCT AAC ACA TTA GCC AAG ATT GCT GGT GGG TTA TGT GAT AAT 384 
Leu Ser Ala Asn Thr Leu Ala Lys lie Ala Gly Gly Leu Cys Asp Asn 
115 120 125 

CTA TTG ACA TGT ATA GTA AGA GCA TGG GAT TAT AGC AAA CCG TTG TTT 432 
Leu Leu Thr Cys lie Val Arg Ala Trp Asp Tyr Ser Lys Pro Leu Phe 
130 135 140 

GTT GCA CCG-GCG ATG 7VAG. ACT TTG^^TG^^^ 480 
Val Ala Pro^-Ala Mefe^AsnrThi^^^Leu^Me^^^^^ 

145 15;0i% 155111*,^ 16.o# 

CGG CAC CTT GTC TTG^CTT^GAl^GAA^ CTT^GGAliJyrG^^^ 528 
Arg His Leu* Val Leu Leu^^Aspi^plu Leu**Gly lie Thir«*Leu?s^Ile ^Prb*^Pro?<- 
165 ^ 170''^^ 175*^ 

ATC AAG AAG^AAA CTGt^GCG^TGT^GGi^ 576 
He Lys Lys ^ Lys Leu**Ala'^Gyi5^Gly^Asp* Tyi^Gl^^^ 

180 185 190 

GAG CCT TCT CTG ATT TAT TCC ACT GTT AGA CTG TTC TGG GAG TCA CAA 624 
Glu Pro Ser Leu He Tyr Ser Thr Val Arg Leu Phe Trp Glu Ser Gin 
195 200 205 

GCT CGT AAA CAA AGA GAT GGA ACC AGT TG 654 

-Ala;--Arg,;:LyS::;Gln:;-J^^ - 

210 215 



(2) INFORMATION FOR SEQ ID NO :^ 8: 



(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) t^ TYPE1;- amino- acid 
(D) -TOPOLOGY^ -^ linear ^ 

(ii) MOLECULE TYPE: protein 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 8: 



Glu Phe Gly Thr Ser Ser Phe Leu Gly Cys Asn Lys He Glu Lys Lys 




89 



1 5 10 15 

Met Asn Met Glu Val Asp Thr Val Thr Arg Lys Pro Arg lie Leu Leu 
20 25 30 

Ala Ala Ser Gly Ser Val Ala Ser He Lys Phe Ser Asn Leu Cys His 
35 40 45 

Cys Phe Ser Glu Trp Ala Glu Val Lys Ala Val Ala Ser Lys Ser Ser 
50 55 60 



Leu Asn Phe Val Asp Lys Pro Ser Leu Pro Gin Asn Val Thr Leu Tyr 
65 70 75 80 

Thr Asp Glu Asp Glu Trp Ser Ser Trp Asn Lys He Gly Asp Pro Val 
85 90 95 

Leu His He Glu Leu Arg Arg Trp Ala Asp Val Met lie He Ala Pro 
100 105 110 

Leu Ser Ala Asn Thr Leu Ala Lys He Ala Gly Gly Leu Cys Asp Asn 
115 120 125 

Leu Leu Thr Cys He Val Arg Ala Trp Asp Tyr Ser Lys Pro Leu Phe 
130 135 140 

Val Ala Pro Ala Met Asn Thr Leu Met Trp Asn Asn Pro Phe Thr Glu 
145 150 155 160 

Arg His Leu Val Leu Leu Asp Glu Leu Gly He Thr Leu He Pro Pro 
165 170 175 

He Lys Lys Lys Leu Ala Cys Gly Asp Tyr Gly Asn Gly Ala Met Ala 
180 185 190 

Glu Pro Ser Leu He Tyr Ser Thr Val Arg Leu Phe Trp Glu Ser Gin 
195 200 205 



Ala Arg Lys Gin Arg Asp Gly Thr Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 
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( ix) FEATURE : 

(A) KAME/KEY: CDS 

(B) LOCATION: 63 . .1583 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAATTCCTCG AGCTACGTCA GGGCCCTGAC GTAGCCGTCA ATCGAAATCC CAAAGATCAG 60 

CG ATG GTG ACT CTA AAC GCT TCT TCT CCT CTC ACG ACC AAG TCG TTC 107 
Met Val Thr Leu Asn Ala Ser Ser Pro Leu Thr Thr Lys Ser Phe 
1 5 10 15 

CTC CCC TAC CGT CAC GCT CCT CGC CGT CCA ATC TCT TTC TCC CCT GTC 155 
Leu Pro Tyr Arg His Ala Pro Arg Arg Pro lie Ser Phe Ser Pro Val 
20 25 30 

TTC GCC GTT CAT TCG ACT GAC CCC AAG AAA TCT ACC CAA TCA GCC TCC 203 
Phe Ala Val His Ser Thr Asp Pro Lys Lys Ser Thr Gin Ser Ala Ser 
35 40 45 

GCT TCG GTT AAA TGG AGT CTA GAG AGT TGG AAG TCG AAG AAA GCT TTG 251 
Ala Ser Val Lys Trp Ser Leu Glu Ser Trp Lys Ser Lys Lys Ala Leu 
50 55 60 

CAA TTG CCG GAT TAT CCT GAT CAG AAG GAT GTT GAT TCG GTT CTA CAG 299 
Gin Leu Pro Asp Tyr Pro Asp Gin Lys Asp Val Asp Ser Val Leu Glri 
65 70 75 

ACG CTT TCT TCT TTT CCT CCT ATA GTT TTC GCT GGT GAG GCT AGG AAA 347 
Thr Leu Ser Ser Phe Pro Pro lie Val Phe Ala Gly Glu Ala Arg Lys • 
80 85 90 95 

CTA GAG GAT AAG CTT GGT CAA GCG GCT ATG GGT CAA GCC TTT ATG CTT 3 95 

Leu Glu Asp Lys Leu Gly Gin Ala Ala Met Gly Gin Ala Phe Met Leu 
100 105 110 

CAA GGT GGT GAT TGT GCT GAG AGT TTC AAG GAA TTT AAC GCT AAT AAC 443 
Gin Gly Gly Asp Cys Ala Glu: Ser Phe Lys Glu Phe Asn Ala Asn Asn 
115 120 125 

ATT AGA GAC ACC TTT AGG GTT CTT CTT CAG ATG GGT GTT GTT CTC ATG 491 
lie Arg Asp Thr Phe Arg Val Leu Leu Gin Met Gly Val Val Leu Met 
130 135 140 

TTC GGT GGC CAG TTA CCA GTT ATC AAG GTG GGA AGA ATG GCT GGT CAG 539 
Phe Gly Gly Gin Leu Pro Val lie Lys Val Gly Arg Met Ala Gly Gin 
145 150 155 

TTT GCG AAG CCG AGA TTA GAC CCA TTT GAG GAG AAA GAT GGT GTG AAG 5 87 

Phe Ala Lys Pro Arg Leu Asp Pro Phe Glu Glu Lys Asp Gly Val Lys 
160 165 170 175 

CTG CCG AGT TAC AGA GGA GAT AAC ATA AAT GGT GAT GCT TTT GAT GAG 635 



III iHi ^^mammmmmmammmmm^m^^mm 
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Leu Pro Ser Tyr Arg Gly Asp Asn lie Asn Gly Asp Ala Phe Asp Glu 
180 185 190 

AAA TCG AGG ATT CCT GAT CCT CAT AGG ATG GTT AGA GCG TAG ACA GAG 683 
Lys Ser Arg lie Pro Asp Pro His Arg Met Val Arg Ala Tyr Thr Gin 
195 200 205 

TCT GTG GCT ACG TTG AAT CTC TTG AGA GCA TTT GCT ACT GGA GGT TAT 731 
Ser Val Ala Thr Leu Asn Leu Leu Arg Ala Phe Ala Thr Gly Gly Tyr 
2X0 215 220 

GCA GCT ATG CAG AGA GTT AGC CAG TGG AAC CTT GAT TTC ACG CAA CAT 779 
Ala Ala Met Gin Arg Val Ser Gin Trp Asn Leu Asp Phe Thr Gin His 
225 230 235 

AGT GAA CAG GGT GAC AGG TAG CGT GAA TTG GCT AAT AGA GTT GAT GAG 827 
Ser Glu Gin Gly Asp Arg Tyr Arg Glu Leu Ala Asn Arg Val Asp Glu 
240 245 250 255 

GCT TTG GGA TTC ATG GGT GCA GCT GGA CTT ACT AGT GCT CAC CCG ATC 875 
Ala Leu Gly Phe Met Gly Ala Ala Gly Leu Thr Ser Ala His Pro lie 
260 265 270 

ATG ACT ACT ACT GAG TTT TGG ACA TCC CAT GAG TGT TTG TTA TTG CCT 923 
Met Thr Thr Thr Glu Phe Trp Thr Ser His Glu Cys Leu Leu Leu Pro 
275 280 285 

TAT GAG CAA GCA CTC ACA AGA GAG GAT TCA ACA TCT GGA CTT TAC TAT 971 
Tyr Glu Gin Ala Leu Thr Arg Glu Asp Ser Thr Ser Gly Leu Tyr Tyr 
290 295 300 

GAT T6C TCT GCG CAC ATG CTT TGG GTT GGA GAA CGA ACT CGC CAA CTT 1019 
Asp Cys Ser Ala His Met Leu Trp Val Gly Glu Arg Thr Arg Gin Leu 
305 310 315 

GAT GGT GCT CAT GTT GAG TTT CTG AGG GGA ATC GCT AAC CCC CTC GGA 1067 
Asp Gly Ala His Val Glu Phe Leu Arg Gly lie Ala Asn Pro Leu Gly 
320 325 330 335 

ATC AAG GTG AGT GAT AAA ATG GTC CCT AGT GAA CTG GTT AAG CTG ATA 1115 
lie Lys Val Ser Asp Lys Met Val Pro Ser Glu Leu Val Lys Leu lie 
340 345 350 

GAG ATA CTA AAC CCT CAG AAC AAG CCT GGA AGG ATT ACG GTT ATA GTG 1163 
Glu lie Leu Asn Pro Gin Asn Lys Pro Gly Arg lie Thr Val lie Val 
355 360 365 

AGA ATG GGA GCT GAG AAT ATG CGG GTC AAG CTT CCT AAT TTG ATC AGA 1211 
Arg Met Gly Ala Glu Asn Met Arg Val Lys Leu Pro Asn Leu lie Arg 
370 375 380 

GCA GTC CGT GGA GCC GGT CAG ATT GTG ACT TGG GTT AGT GAT CCA ATG 1259 
Ala Val Arg Gly Ala Gly Gin lie Val Thr Trp Val Ser Asp Pro Met 
385 390 395 



■"-^;■l-^:-;•:■:-;■^>;•;■:-:■;*;s■>;s•;s%s•;v^^so^:-:-:-x•:-;■;o:•^;•;■x^ 
.^w.^v.%v.^^^^w.v.w.^.•.^w.^v.v.^%v.^-.-.v.-^^;•;^•X•^rf•;•^ 
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CAC GGA AAC ACA ATC ATG GCT CCT GGT GGG CTA AAA ACT CGT TCT TTC 1307 
His Gly Asn Thr lie Met Ala Pro Gly Gly Leu Lys Thr Arg Ser Phe 
400 405 410 415 

GAT GCA ATC AGG GCG GAA TTG AGA GCG TTC TTC 'iSac'IgTC: ^AT GAT tiAA 1355 
Asp Ala lie Arg Ala Glu Leu Arg Ala Phe Phe Asp Val His Asp Gin 
420 425 430 

GAA GGG AGT TTC CCT GGC GGG GTT CAT TTA GAA ATG ACT GGT CAA AAC 1403 
Glu Gly Ser Phe Pro Glv Gly Val His Leu Glu Met Thr Gly Gin Asn 
435 440 445 

GTG ACT GAA TGT GTC GGA GGG TCA CGC ACC ATC ACT TAC AAC GAT CTA 1451 
val Thr Glu Cys Val Gly Gly Ser Arg Thr lie Thr Tyr Asn Asp Leu 
450 455 460 

AGC TCA CGC TAC CAC ACT CAC TGT GAC CCA AGA CTC AAC GCA TCT CAG 1499 
Ser Ser Arg Tyr His Thr His Cys Asp Pro Arg Leu Asn Ala Ser Gin 
465 470 475 

TCT CTG GAG CTT GCA TTC ATC ATT GCA GAG CGT CTG CGA AAG AGA AGG 1547 
Ser Leu Glu Leu Ala Phe He He Ala Glu Arg Leu Arg Lys Arg Arg 
480 485 490 495 

CTT GGT TCC GGG AAT CTT CCG TCA TCT ATT^?^GGA GTC-^TAGAGAAGAA 1593 
Leu Gly Ser Gly Asn Leu Pro Ser Ser^^Ile-Gly Vai*-. 

500 - n SOS-n^^ 

GAAAATACTT ATCCGAGCTA GGATGTGTGT 'GTATAGAGGG TGATGTCTAG^TTATTAAGTT 1653 

GCCAAGTTAA ATGAGCTTGT GTACTGTTAA AAGTAAGATA TTGTTGTTTT -TGTGTGTTGG 1713 

GTTATGATTT TGTCTGAAAT /JVGTGGCTGA CTTTATAACC CGTAAATCTC TACGTCACGC 1773 

TTGCAACAAA AATTCGATAT TTGATTCAAT CACAGAAAAG TCCTCCCATT AAGGTGTAAA 1833 

CCCTGACGTA GCTCGAGGAA TTC ^^^6 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 10: 

Met Val Thr Leu Asn Ala Ser Ser Pro Leu Thr Thr Lys Ser Phe Leu 
1 5 10 15 



Pro Tyr Arg His Ala Pro Arg Arg Pro He Ser Phe Ser Pro Val Phe 
20 25 30 



Ala Val His Ser Thr Asp Pro Lys Lys Ser Thr Gin Ser Ala Ser Ala 
35 40 45 

Ser Val Lys Trp Ser Leu Glu Ser Trp Lys Ser Lys Lys Ala Leu Gin 
50 55 60 

Leu Pro Asp Tyr Pro Asp Gin Lys Asp Val Asp Ser Val Leu Gin Thr 
65 70 75 80 

Leu Ser Ser Phe Pro Pro lie Val Phe Ala Gly Glu Ala Arg Lys Leu 
85 90 95 

Glu Asp Lys Leu Gly Gin Ala Ala Met Gly Gin Ala Phe Met Leu Gin 
100 105 110 

Gly Gly Asp Cys Ala Glu Ser Phe Lys Glu Phe Asn Ala Asn Asn lie 
115 120 125 

Arg Asp Thr Phe Arg Val Leu Leu Gin Met Gly Val Val Leu Met Phe 
130 135 140 

Gly Gly Gin Leu Pro Val lie Lys Val Gly Arg Met Ala Gly Gin Phe 
145 150 155 160 

Ala Lys Pro Arg Leu Asp Pro Phe Glu Glu Lys Asp Gly Val Lys Leu 
165 170 175 

Pro Ser Tyr Arg Gly Asp Asn lie Asn Gly Asp Ala Phe Asp Glu Lys 
180 185 190 

Ser Arg lie Pro Asp Pro His Arg Met Val Arg Ala Tyr Thr Gin Ser 
195 200 205 

Val Ala Thr Leu Asn Leu Leu Arg Ala Phe Ala Thr Gly Gly Tyr Ala 
210 215 220 

Ala Met Gin Arg Val Ser Gin Trp Asn Leu Asp Phe Thr Gin His Ser 
225 230 235 240 

Glu Gill Gly Asp Arg Tyr Arg Glu Leu Ala Asn Arg Val Asp Glu Ala 

'-'^^ • 245 "'^ -^ -250^"- - ' 255 " 

Leu Gly Phe Met Gly Ala Ala Gly Leu Thr Ser Ala His Pro lie Met 
260 265 270 

Thr Thr Thr Glu Phe Trp Thr Ser His Glu Cys Leu Leu Leu Pro Tyr 
275 280 285 

Glu Gin Ala Leu Thr Arg Glu Asp Ser Thr Ser Gly Leu Tyr Tyr Asp 
290 295 300 

Cys Ser Ala His Met Leu Trp Val Gly Glu Arg Thr Arg Gin Leu Asp 
305 310 315 320 

Gly Ala His Val Glu Phe Leu Arg Gly lie Ala Asn Pro Leu Gly lie 



^V-^^••^•■••••••••:•:■•■"•■;•'•*■'.^;.v.^y.;.^.;.;.^;.•.;.'.-.;. 



Prim6(^:28^0..152d0&1 
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325 



330 



335 



Lys Val Ser Asp Lys Met Val Pro Ser Glu Leu Val Lys Leu lie Glu 
340 345 350 

He Leu Asn Pro Gin Asn Lys Pro Gly Arg He Thr Val He Val Arg 
355 360 365 

Met Gly Ala Glu Asn Met Arg Val Lys Leu Pro Asn Leu He Arg Ala 
370 375 380 

Val Arg Gly Ala Gly Gin He Val Thr Trp Val Ser Asp Pro Met His 
385 390 395 400 

Gly Asn Thr He Met Ala Pro Gly Gly Leu Lys Thr Arg Ser Phe Asp 
405 410 415 

Ala He Arg Ala Glu Leu Arg Ala Phe Phe Asp Val His Asp Gin Glu 
420 425 430 

Gly Ser Phe Pro Gly Gly Val His Leu Glu Met Thr Gly Gin Asn Val 
435 440 445 ' 

Thr Glu Cys Val Gly vGly Ser Arg Thr..Ile-Thr^Tyr^^A^^ 
450 455 460^ 

Ser Arg Tyr His Thr- His. Cys . Asp, Pro Arg-Leu^^^ A^^ 

465 47,a'*^*^ 475^. 480- 

Leu Glu Leu Ala Phe. lie. He Ala^ Glu Arg«Leu*-Arg Lys-^Arg^Arg 



485- 



490m- 



495^ 



Gly Ser Gly Asn Leu Pro Ser Ser He Gly Val 
500 505 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1081 base pairs 
_(B V, :TYPE ; - nucleij:_.acid_ 



(cT straitoiSneSs :' 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE:^ 

(A) NAME/ KEY: CDS 

(B) LOCATION : 1 . . 954 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 



mill III 
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GAA TTC GGC ACG AGG GAT CCC AAG AAC CTA AAT CGT CAC CAA GTA CCA 48 
Glu Phe Gly Thr Arg Asp Pro Lys Asn Leu Asn Arg His Gin Val Pro 
510 515 520 

AAT TTC TTG AAC CCA CCA CCA CCA CCG CGA AAT CAG GGT TTG GTA GAT 96 
Asn Phe Leu Asn Pro Pro Pro Pro Pro Arg Asn Gin Gly Leu Val Asp 
525 530 535 

GAT GAT GCT GCT TCT GCT GTT GTT TCC GAC GAG AAT CGC AAA CCA ACA 144 
Asp Asp Ala Ala Ser Ala Val Val Ser Asp Glu Asn Arg Lys Pro Thr 
540 545 550 555 

ACT GAG ATT AAA GAT TTC CAG ATC GTG GTC TCT GCT TCC GAC AAA GAA 192 
Thr Glu lie Lys Asp Phe Gin lie Val Val Ser Ala Ser Asp Lys Glu 
560 565 570 

CCA AAC AAG AAG AGT CAG AAT CAG AAC CAG CTT GGT CCT AAG AGA AGC 240 
Pro Asn Lys Lys Ser Gin Asn Gin Asn Gin Leu Gly Pro Lys Arg Ser 
575 580 585 

TCT AAC AAA GAC AGA CAC ACT AAA GTC GAA GGT AGA GGT CGA CGA ATT 288 
Ser Asn Lys Asp Arg His Thr Lys Val Glu Gly Arg Gly Arg Arg lie 
590 595 600 

CGG ATG CCT GCT CTT TGT GCT GCT AGG ATT TTT CAA TTG ACT AGA GAA 336 
Arg Met Pro Ala Leu Cys Ala Ala Arg lie Phe Gin Leu Thr Arg Glu 
605 610 615 

TTG GGT CAT AAA TCT GAT GGT GAA ACT ATC CAG TGG CTG CTT CAA CAA 3 84 

Leu Gly His Lys Ser Asp Gly Glu Thr lie Gin Trp Leu Leu Gin Gin 
620 625 630 635 

GCT GAG CCA TCG ATT ATT GCA GCT ACT GGT TCA GGA ACT ATA CCG GCC 432 
Ala Glu Pro Ser He He Ala Ala Thr Gly Ser Gly Thr He Pro Ala 
640 645 650 

TCT GCT TTA GCT TCT TC\ GCT GCA ACC TCT AAC CAT CAT CAA GGT GGG 4 80 

Ser Ala Leu Ala Ser Ser Ala Ala Thr Ser Asn His His Gin Gly Gly 
655 660 665 

TCT CTT ACT GCT GGT TTA ATG ATC AGT CAT GAC TTA GAT GGT GGG TCT 528 
Ser Leu Thr Ala Gly Leu Met He Ser His Asp Leu Asp Gly Gly Ser 
670 675 680 

AGT AGT AGT GGT AGA CCA TTA AAT TGG GGG ATT GGT GGC GGT GAA GGA 576 
Ser Ser Ser Gly Arg Pro Leu Asn Trp Gly He Gly Gly Gly Glu Gly 
685 690 695 

GTT TCT AGG TCA AGT TTA CCA ACT GGG TTA TGG CCA AAT GTA GCT GGG 624 
Val Ser Arg Ser Ser Leu Pro Thr Gly Leu Trp Pro Asn Val Ala Gly 
700 705 710 715 

TTT GGT TCT GGT GTG CCA ACC ACT GGT TTA ATG AGT GAA GGA GCT GGT 672 
Phe Gly Ser Gly Val Pro Thr Thr Gly Leu Met Ser Glu Gly Ala Gly 
720 725 730 




96 



TAT AGA ATT GGG TTT CCT GGT TTT GAT TTT OCT 
Tyr Arg He Gly Phe Pro Gly Phe Asp Phe Pro 
735 740 

AGT TTT GCA TCT ATT TTG GGT GGG AAT CAT AAT 
Ser Phe Ala Ser He Leu Gly Gly Asn His Asn 
750 755 

GAG TTA GGC TTG TCT CAA GAA GGG AAT GTT GGT 
Glu Leu Gly Leu Ser Gin Glu Gly Asn Val Gly 
765 770 

TCT TTT ACT CAG ATT TAT CAA CAG ATG GGT CAG 
Ser Phe Thr Gin He Tyr Gin Gin Met Gly Gin 
780 785 790 

CAA GGT AGG GTT CTT CAC CAT ATG CAT CAT AAC 
Gin Gly Arg Val Leu His His Met His His Asn 
800 805 

CAA GAG AGT GGT GAG AAA GAT GAT TCT CAA GGC 
Gin Glu Ser Gly Glu Ly.^ Asp Asp Ser Gin ^Gly 
815 820 - 



GGT GTT GGT CAT ATG 
Gly Val Gly His Met 
745 

CAG ATG CCT GGA CTT 
Gin Met Pro Gly Leu- 
760 

GTT TTG AAT CCT CAG 
Val Leu Asn Pro Gin 
775 

GCT CAG GCT CAA GCT 
Ala Gin Ala Gin Ala 
795 

CAT GAA GAA CAT CAG 
His Glu Glu His Gin 
810 

TCA GGT CGT 
Ser Gly Arg , 
825 



TAAAAGGATT GGGTTTTTTT TGTATCTTCT GGATTTGAAA 
GTGATAATAT TGTTGTAATT-'TGTAeCAeCA TGGAGAAGAA- 
AAAAAAA 



AAGGTITTTGG- CTTTTGTTTT 
r^AAAGAAAAGG TTATATAAAA 



720 

768 

816 

864 

912 

954 

1014 
1074 
1081 



(2) INFORMATION FOR SBQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 318 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(ii)__ MOLECUIiE„TYP? :_.prpt^ — : , - 7 ;: : ~; : : r: : : 

(xiT'siQUENCE DESCTli^ SEQ' ID NCT: 12: 

Glu Phe Gly Thr Arg Asp Pro Lys Asn Leu Asn Arg His Gin Val Pro 
1 5 10 15 

Asn Phe Leu Asn Pro Pro Pro Pro Pro Arg Asn Gin Gly Leu Val Asp 
20 25 30 

Asp Asp Ala Ala ser Ala Val Val Ser Asp Glu Asn Arg Lys Pro Thr 
35 40 45 

Thr Glu He Lys Asp Phe Gin He Val Val Ser Ala Ser Asp Lys Glu 
50 55 60 

Pro Asn Lys Lys Ser Gin Asn Gin Asn Gin Leu Gly Pro Lys Arg Ser 



Ul lJ^ . ' 



t 
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65 70 75 80 

Ser Asn Lys Asp Arg His Thr Lys Val Glu Gly Arg Gly Arg Arg He 
85 90 95 

Arg Met Pro Ala Leu Cys Ala Ala Arg He Phe Gin Leu Thr Arg Glu 
100 105 110 

Leu Gly His Lys Ser Asp Gly Glu Thr He Gin Trp Leu Leu Gin Gin 
115 120 125 

Ala Glu Pro Ser He He Ala Ala Thr Gly Ser Gly Thr He Pro Ala 
130 135 140 

Ser Ala Leu Ala Ser Ser Ala Ala Thr Ser Asn His His Gin Gly Gly 
145 150 155 160 

Ser Leu Thr Ala Gly Leu Met He Ser His Asp Leu Asp Gly Gly Ser 
165 170 175 

Ser Ser Ser Gly Arg Pro Leu Asn Trp Gly He Gly Gly Gly Glu Gly 
180 185 190 

Val Ser Arg Ser Ser Leu Pro Thr Gly Leu Trp Pro Asn Val Ala Gly 
195 200 205 

Phe Gly Ser Gly Val Pro Thr Thr Gly Leu Met Ser Glu Gly Ala Gly 
210 215 220 

Tyr Arg He Gly Phe Pro Gly Phe Asp Phe Pro Gly Val Gly His Met 
225 231 235 240 

Ser Phe Ala Ser He Leu Gly Gly Asn His Asn Gin Met Pro Gly Leu 
245 250 255 

Glu Leu Gly Leu Ser Gin Glu Gly Asn Val Gly Val Leu Asn Pro Gin 
260 265 270 

Ser Phe Thr Gin He Tyr Gin Gin Met Gly Gin Ala Gin Ala Gin Ala 
275 280 285 

Gin Gly Arg Val Leu His His Met His His Asn His Glu Glu His Gin 
290 295 300 

Gin Glu Ser Gly Glu Lys Asp Asp Ser Gin Gly Ser Gly Arg 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY, linear 




(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .774 



iliAiillilil 
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Mini 



mm. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AAT TCG GCA CGA GGC AAC AAA AAG AGG AGA CTT CCT GTA GAT GAG CAG 
Asn Ser Ala Arg Gly Asn Lys Lys Arg Arg Leu Pro Val Asp Glu Gin 
320 325 330 



48 



GAG AAT CGT GGT GAC AA^ GTG GCT AAT GGT CTT AAC CGC CAG ATT GTT 
Glu Asn Arg Gly Asp Asn Val Ala Asn Gly Leu Asn Arg Gin lie Val 
335 340 345 350 
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AGA TAT CAG CCG TCG ATA AAC GAA GCA GCA CAA AAT ATG CTT CGA CAG 
Arg Tyr Gin Pro Ser lie Asn Glu Ala Ala Gin Asn Me fe- Leu* Arg -Gin 
355? 360^ zesm 



144 



TTC TTA AAT ACT AGT ACpS TCArGCT^CGG^^TAT^^GAA 192 
Phe Leu Asn Thr ^Se^. Tha^^^^Seri^EPro^^ A^ Asn Asn 

3 70*^^ 3 75^ 3 8 Om 

CCT GAC AGT TTC CTA TTG^^GGT^ GAT GTTCKTCC AGT^^CTT^C : TC^^^ 240 
Pro Asp Ser Phe^eu^^Leu^^Gly Asp Va^-Pro Ser Se?i=*^*Thr4Ser Val Asp* 
385 3 90^ 395%^ 

AAT GGG AAC CCT TCA AGT AGA GTT TCT GGA GTA ACA TTG GCC GAG TTT 288 
Asn Gly Asn Pro Ser Ser Arg Val Ser Gly Val Thr Leu Ala Glu Phe 
400 405 410 



TCA CCC AAC ACA GTT CAG TCA GCA ACG AAT CAA GTA CCC GAA GCA AGT 
Ser Pro Asn Thr Val Gin Ser Ala Thr Asn Gin Val Pro Glu Ala Ser 
415 420 425 430 



336 



TTG GCT CAT CAT CCT CAA GCT GGT CTG GTT CAG CCA AAT ATA GGT CAA 
Leu Ala His His Pro Gin Ala Gly Leu Val Gin Pro Asn lie Gly Gin 
435 440 445 



384 



AGT CCG GCT CAA GGA GCA GCA CCT GCA GAC TCT TGG AGC CCT GAA TTT 
Ser Pro Ala Gin Gly Ala Ala Pro Ala Asp Ser Trp Ser Pro Glu Phe 
450 455 460 



432 



GAT TTA GTT GGA TGC GAG. ACA GAT AGT GGA GAG JTGT .TTT GAT ,GCA ATA 
Asp Leu Val Gly Cys Glu Thr Asp Ser Gly Glu Cys Phe Asp Pro lie 
465 470 475 



480 



ATG GCT GTT TTA GAT GAG TCA GAA GGC GAT GCA ATT TCT CCT GAA GGT 
Met Ala Val Leu Asp Glu Ser Glu Gly Asp Ala lie Ser Pro Glu Gly 
480 485 490 



528 



no 
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GAG GGC AAG ATG AAT GAG TTA CTG GAG GGA GTC CCT AAG CTG CCC GGA S76 
Glu Gly Lys Met Asn Glu Leu Leu Glu Gly Val Pro Lys Leu Pro Gly 
495 500 505 510 

ATC CAA GAT CCA TTC TGG GAA CAG TTC TTT TCT GTT GAA CTC CCA GCG 624 
lie Gin Asp Pro Phe Trp Glu Gin Phe Phe Ser Val Glu Leu Pro Ala 
515 520 525 

ATT GCA GAT ACA GAC GAT ATT CTA TCA GGA TCA GTG GAG AAT AAT GAG 672 
lie Ala Asp Thr Asp Asp lie Leu Ser Gly Ser Val Glu Asn Asn Asp 
530 535 540 

TTG GTA TTG GAA CAA GAA CCA AAC GAG TGG ACC CGT AAT GAA CAA CAA 720 
Leu Val Leu Glu Gin Glu Pro Asn Glu Trp Thr Arg Asn Glu Gin Gin 
545 550 555 

ATG AAG TAT CTT ACT GAA CAA ATG GGA CTG CTT TCC TCA GAA GCA CAG 768 
Met Lys Tyr Leu Thr Glu Gin Met Gly Leu Leu Ser Ser Glu Ala Gin 
560 565 570 

AGG AAA TAA 777 

Arg Lys 

575 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Asn Ser Ala Arg Gly Asn Lys Lys Arg Arg Leu Pro Val Asp Glu Gin 
1 5 10 15 

Glu Asn Arg Gly Asp Asn Val Ala Asn Gly Leu Asn Arg Gin lie Val 
20 25 30 

Arg Tyr Gin Pro Ser lie Asn Glu Ala Ala Gin Asn Met Leu Arg Gin 
35 40 45 

Phe Leu Asn Thr Ser Thr Ser Pro Arg Tyr Glu Ser Val Ser Asn Asn 
50 55 60 

Pro Asp Ser Phe Leu Leu Gly Asp Val Pro Ser Ser Thr Ser Val Asp 
65 70 75 80 

Asn Gly Asn Pro Ser Ser Arg Val Ser Gly Val Thr Leu Ala Glu Phe 
85 90 95 

Ser Pro Asn Thr Val Gin Ser Ala Thr Asn Gin Val Pro Glu Ala Ser 
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100 



100 105 110 

Leu Ala His His Pro Gin Ala Gly Leu Val Gin Pro Asn lie Gly Gin 
115 120 125 



Ser Pro Ala Gin Gly Ala Ala Pro Ala Asp Ser Trp Ser Pro Glu Phe 
130 135 140 

Asp Leu Val Gly Cys Glu Thr Asp Ser Gly Glu Cys Phe Asp Pro lie 
145 150 155 160 

Met Ala Val Leu Asp Glu Ser Glu Gly Asp Ala lie Ser Pro Glu Gly 
165 170 175 

Glu Gly Lys Met Asn Glu Leu Leu Glu Gly Val Pro Lys Leu Pro Gly 
180 185 190 

He Gin Asp Pro Phe Trp Glu Gin Phe Phe Ser Val Glu Leu Pro Ala 
195 200 205 

He Ala Asp Thr Asp Asp lie Leu Ser Gly Ser Val Glu Asn Asn Asp 
210 215 ^ 22p 

Leu Val Leu Glu -^Gln -Glu. Pro Asn^ Glu Trp Thrv.Arg Asn Glu-^Gln Gin 
225 230*'^ 235 240 

Met Lys Tyr Leu Thr Glu Gin Met. Gly Leu Leu Ser« Ser-^Glu = Ala Gin 
245 250-t 255^^ 

Arg Lys 



(2) INFORMATION FOR SEQ ID NO: 15: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic scicL 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGGATCCGAA TTCATGGAGA >; ACGAG 25 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDKDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGGATCCGAA TTCTCAGAAC TGAGA 25 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQtTENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGGGATGTTT AATACCACTA C 21 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCACAGTTGA AGTGAACTTG C 21 



:-:-:::::-:::-:-j:-:-x<:^:->:-;:>^^^ 
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(2) INFORMATION FOR SKQ ID NO: 19: 

(i) SEQUENCE eHARl\CTER«eSTICS": 

(A) LENGOT: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPGXiOGY:- linear 

(ii) MOLECULE TYPE: other nucleic acid • 

(A) DESCRIPTION: /desc = "oligonucleotide* 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGAGATCTGA ATTCATGGAT CAGTA 



25 



( 2 ) INFORMATION TOR. SEQ^^IDr NO«> 2 Q< 

( i ) SEQUENCE->CHARA®I^R'I«§Ta:GS^1N^ 

(A) LENGTH*:»»%26':H*bas.ei*paKL»rsii«i^ 

(B) TYPE^:*^nuelelc acidc^ 

(C) STRANDEPNESS:i:^singa;ei-v 

(D) TOPOL©Gy#:^linea'is^r 

( ii) MOLECULE .TYPB^f?*6thei?». nuc*leic ac-id 

(A) DESCRIPTION:*^* /desc*^^= "ol^igonucleotide"^ 

(iii) HYPOTHETICAL: YES 



(xi) SEQXJENCE DESCRIPTION; SEQ ID NOt 20- 
CGAGATCTGA ATTCCTAAGG CATGCC 



26 
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Claims 

1. A DNA sequence encoding a cell cycle interacting protein or encoding an 
immunologically active and/or functional fragment of such a protein, selected 
from the group consisting of: 

(a) DNA sequences 

(aa) comprising a nucleotide sequence encoding at least the mature 
form of a protein (LDV1 1 5) comprising the amino acid sequence as 
given in SEQ ID NO: 2; 

(ab) comprising the nucleotide sequence as given in SEQ ID NO: 1; 

(ac) comprising a nucleotide sequence hybridizing with the 
complementary strand of a nucleotide sequence as defined in (aa) 
or (ab) under stringent hybridization conditions: 

(ad) comprising an nucleotide sequence encoding a protein having an 
amino acid sequence at least 60 % Identical to the amino acid 
sequence encoded by the nucleotide sequence of (aa) or (ab); 

(ae) comprising a nucleotide sequence encoding at least the domain 
binding to CDKs of the protein encoded by the nucleotide sequence 
of any one of (aa) to (ad); 

(b) DNA sequences 

(ba) comprising a nucleotide sequence encoding at least the mature 
fonm of a protein (LDV24) comprising the amino acid sequence as 
given in SEQ ID NO: 4; 

(bb) comprising the nucleotide sequence as given in SEQ ID NO: 3; 
(be) comprising a nucleotide sequence hybridizing with the 

complementary strand of a nucleotide sequence as defined in (ba) 
or (bb) under stringent hybridization conditions; 
(bd) comprising an nucleotide sequence encoding a protein having an 
amino acid sequence at least 60 % identical to the amino acid 
sequence encoded by the nucleotide sequence of (ba) or (bb); 
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(be) comprising a nucleotide sequence encoding at least the cyclin-like 
interacting domain of the protein encoded by the nucleotide 
sequence of any one of (ba) to (bd); 

(c) DNA sequences 

(ca) comprising a nucleotide sequence encoding at least the mature 
form of a protein (VB33) comprising the amino acid sequence as 
given in SEQ ID NO: 6; 

(cb) comprising the nucleotide sequence as given in SEQ ID NO: 5; 

(cc) comprising a nucleotide sequence hybridizing with the 
complementary strand of a nucleotide sequence as defined in (ca) 
or (cb) under stringent hybridization conditions; 

(cd) comprising an nucleotide sequence encoding a protein having an 
amino acid sequence at least 60 % identical to the amino acid 
sequence encoded by the nucleotide sequence of<ca) or (cb); 

(ce) comprising a nucleotide sequence encoding at^ least the domain 
binding to CDKs of the^^protein encoded by the nucleotide sequence 
of any one of (ca) to (cd); 

(d) DNA sequences 

(da) comprising a nucleotide sequence encoding at least the mature 
form of a protein (VB89) comprising the amino acid sequence as 
given in SEQ ID NO: S; 

(db) comprising the nucleotide sequence as given in SEQ ID NO: 7; 

(dc) comprising a nucleotide sequence hybridizing with the 
complementary strand of a nucleotide sequence as defined in (da) 
or (db) under stringent hybridization conditions; 

(dd) comprising an nucleotide sequence encoding a protein having an 
amino acid sequence at least 60 % identical to the amino acid 
sequence encoded by the nucleotide sequence of (da) or (db); 

(de) comprising a nucleotide sequence encoding at least the domain 
binding to CDKs of the protein encoded by the nucleotide sequence 
of any one of (da) to (dd); 




(e) DNA sequences 

(ea) comprising a nucleotide sequence encoding at least the matur 
fomn of a protein (VBDAHP) comprising the amino acid sequence as 
giver in SEQ ID NO: 10; 

(eb) comprising the nucleotide sequence as given in SEQ ID NO: 9; 

(ec) comprising a nucleotide sequence hybridizing with the 
complementary strand of a nucleotide sequence as defined in (ea) 
or (eb) under stringent hybridization conditions; 

(ed) comprising an nucleotide sequence encoding a protein having an 
amino acid sequence at least 60 % identical to the amino acid 
sequence encoded by the nucleotide sequence. of (ea) or (eb); 

(ee) comprising a nucleotide sequence encoding at least the domain 
binding to CDKs of the protein encoded by the nucleotide sequence 
of any one of (ea) to (ed); 

(f) DNA sequences 

(fa) comprising a nucleotide sequence encoding at least the mature 
form of a protein (VBDPBP) comprising the amino acid sequence as 
given in SEQ ID NO: 12; 

(fb) comprising the nucleotide sequence as given in SEQ ID NO: 11; 

(fc) comprising a nucleotide sequence hybridizing with th 
complementary strand of a nucleotide sequence as defined in (fa) or 
(fb) under stringent hybridization conditions; 

(fd) comprising an nucleotide sequence encoding a protein having an 
amino acid sequence at least 60 % identical to the amino acid 
sequence encoded by the nucleotide sequence of (fa) or (fb); 

(fe) comprising a nucleotide sequence encoding at least the domain 
binding to CDKs of the protein encoded by the nucleotide sequenc 
of any one of (fa) to (fd); 

(g) DNA sequences 
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(ga) comprising a nucleotide sequence encoding at least the mature 
fomi of a protein (VBHSF) comprising the amino acid sequence as 
given in SEQ ID NO: 14; 

(gb) comprising the nucleotide sequence as given in SEQ ID NO: 13; 

(gc) comprising a nucleotide sequence hybridizing with the 
complementary strand of a nucleotide sequence as defined in (ga) 
or (gb) under stringent hybridization conditions; 

(gd) comprising an nucleotide sequence encoding a protein having an 
amino acid sequence at least 60 % identical to the amino acid 
sequence encoded by the nucleotide sequence of (ga) or (gb); 

(ge) comprising a nucleotide sequence encoding at least the domain 
binding to CDKs of the protein encoded by the nucleotide sequence 
of any one of (ga) to (gd); 

(h) DNA sequences, the nucleotide sequence of which is degenerate as a 
result of the genetic code to a nucleotide sequence of a DNA sequence as 
defined in any one of (a) to (g): and 

(i) DNA sequences comprising a nucleotide sequence encoding a fragment of 
a protein encoded by a DNA sequence of any one of (a) to (h). wherein 
said fragment is capable of interacting with a cell cycle protein. 

A method for Identifying and obtaining cell cycle interacting proteins comprising a 
two-hybrid screening assay wherein CDC2a or CDC2b as a bait and a cDNA 
library of a plant cell suspension as prey are used. 

3. The method of claim 2. wherein said CDC2a is CDC2aAt and CDC2b Is 
CDC2bAt. 



4. 



A DNA sequence encoding a cell cycle interacting protein obtainable by the 
method of claim 2 or 3. 
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5. A nucleic acid molecule of at least 15 nucleotides in length hybridizing specifically 
with a DNA sequence of claim 1 or 4 or with a complementary strand thereof. 

6. A vector comprising a DNA sequence of claim 1 or 4. 

7. The vector of claim 6 which is an expression vector wherein the DNA sequence is 
operatively linked to one or more control sequences allowing the expression in 
prokaryotic and/or eukaryotic host cells. 

8. A host cell containing a vector of claim 6 or 7 or a DNA sequence of claim 1 or 4. 

9. The host cell of claim 8 which is a bacterial, insect, fungal, plant or animal cell. 

10. A method for the production of a cell cycle interacting protein or an 
immunologically active or functional fragment thereof comprising culturing a host 
cell of claim 8 or 9 under conditions allowing the expression of the protein and 
recovering the produced protein from the culture. 

11. A cell cycle interacting protein or an immunologically active or functional fragment 
thereof encodable by a DNA sequence of claim 1 or 4 or obtainable by the 
method of claim 2; 3 or 10. 

12. An antibody specifically recognizing the protein of claim 11 or a fragment or 
epitope thereof. 

13. A method for the production of transgenic plants, plant cells or plant tissue 
comprising the introduction of a DNA sequence of claim 1, 4 or 5 or a vector of 
claim 6 or 7 into the genome of said plant, plant cell or plant tissue. 

14. The method of claim 13 further comprising regenerating a plant from said plant 
tissue or plant cell. 
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15. A transgenic plant cell comprising a DNA sequence of claim 1 or 4 which is 
operably linked to regulatory elements allowing transcription and/or expression of 
the DNA sequence in plant cells or obtainable according to the method of claim 
13 or 14. 

16. The transgenic plant cell of claim 15 wherein said DNA sequence or said vector 
is stably integrated into the genome of the plant cell. 

17. A transgenic plant or a plant tissue comprising plant cells of claim 1 5 or 1 6. 

18. The transgenic plant of claim 17 in which plant cell division and/or growth is 
enhanced and/or^ wherein «^the plant is less sensitive to environmental stress 
compared to*the'GorTespi9.mjdingjj^ildiityge«plantf*4 

19. A transg^nie«plant»;Geil..which Gontainssstablysintegriatedfwintotthe'vgenome a DNA 
sequence of Glaimwl, 4 orS or part-thereofcor obtainable according to the method 
of claim 13- or*^14'?^wherein 4he»atranseripti.on»^and/0r' expression of the DNA 
sequence or part thereof leads to reduction of the synthesis of a cell cycle 
interacting protein in the cells. 

20. The plant cell of claim 19. wherein the reduction is achieved by an antisense. 
sense, ribozyme, co-suppression, dominant mutant effect and/or a knock out 
mutatn in the gene. 

21 . A transgenic plant or plant tissue comprising plant cells of claim 19 or 20. 

22. The transgenic plant of claim 21 which displays a deficiency in plant cell division 
and/or growth. 




23. Harvestable parts or propagation material of plants of any one of claims 17, 18, 
21 or 22 comprising plant cells of claim 15, 16, 19 or 20. 

24. A regulatory sequence of a promoter regulating the expression of a nucleic acid 
molecule comprising the DNA sequence of any one of claim 1 or 4, said 
regulatory sequence being capable of conferring expression of a heterologous 
DNA sequence during various stages of the cell cycle. 

25. A recombinant DNA molecule comprising the regulatory sequence of claim 24. 

26. The recombinant DNA molecule of claim 25, wherein said regulatory sequence is 
operatively linked to a heterologous DNA sequence. 

27. A host cell transformed with a regulatory sequence of claim 24 or a recombinant 
DNA molecule of claim 25 or 26. 

28. A transgenic plant, plant tissue, or plant cell comprising the regulatory sequence 
of claim 24 or the recombinant DNA molecule of claim 25 or 26. 

29. A method for the identification of an activator or inhibitor of cell cycle interacting 
proteins or their encoding genes comprising the steps of. 

(a) culturing a plant cell or tissue or maintaining a plant comprising a 
recombinant DNA molecule comprising a readout system operatively 
linked to a regulatory sequence of claim 24 in the presence of a compound 
or a sample comprising a plurality of compounds under conditions which 
permit expression of said readout system; 

(b) identifying or verifying a sample and compound, respectively, which leads 
to suppression or activation and/or enhancement of expression of said 
readout system in said plant, plant cell, or plant tissue. 



30. A method for identifying and obtaining an activator or inhibitor of cell division 
comprising the steps of: 

(a) combining a compound to be screened with a reaction mixture containing 
the cell cycle interacting protein of claim 11 and a readout system capable 
of interacting with the protein under suitable conditions which permit 
interaction of the protein with said readout system; 

(b) Identifying or verifying a sample and compound, respectively, which leads 
to suppression or activation of the readout system. 

31 . A method of producing a therapeutic agent comprising the steps of the method of 
claim 30 and synthesizing the activator or inhibitor obtained or identified in step 
(b) or an analog or derivative thereof In an amount sufTicient to provide said agent 
in a therapeutically effective amount to a patienti * 

32. A method of producing a plant.effective agent comprising the-steps of the method 
of claim 30 and synthesizing the activator or inhibitor obtained or Identified in step 
(b) or an analog or derivative thereof in an effective. amount sufficient to provide 
said agent in an effective amount suitable for the application In agriculture or 
plant cell and tissue culture. 

33. A method of producing a therapeutic or plant effective composition comprising 
the steps of the method of claim 30 and combining the compound obtained or 
identified in step (b) or an analog or derivative thereof with a pharmaceutically 
acceptable carrier or with a plant cell and tissue culture acceptable carrier. 



34. An activator or inhibitor of a cell division obtained by the method of any 
claims 30 to 32. 




35. A composition comprising a DNA sequence of claim 1 , 4 or 5, a vector of claim 6 
or 7, a protein of claim 1 1 , an antibody of claim 12, or the activator or inhibitor of 
claim 34. 

36. The composition of claim 35 for use as a medicament, a diagnostic means, a kit 
or plant effective agent. 

37. Use of a DNA sequence of claim 1 , 4 or 5, the vector of claim 6 or 7, the protein 
of claim 11. the antibody of claim 12 or the activator or inhibitor of claim 34 for 
modulating the cell cycle in an animal or plant, plant cell division and/or growth, 
for influencing the activity of cell cycle proteins in a plant or animal cell, as 
positive or negative regulator of cell proliferation, for modifying the growth 
inhibition caused by environmental stress conditions, for use in a screening 
method for the identification of inhibitors or activators of cell cycle proteins, as 
growth regulator, herbicide or for inducing nematode resistance in plants. 

38. Use of a DNA sequence of claim 1 , 4 or 5 or the regulatory sequence of claim 24 
as a marker gene in plant or animal cell and tissue culture or as a marker in 
marker-assisted plant breeding. 

39. Use of the two-hybrid system as defined in claim 2 or 3 for the identification of 
cell cycle interacting proteins or activators or inhibitors of such poteins, 

40. Use of a regulatory sequence of claim 24 or a recombinant DNA molecule of 
claim 25 or 26, for the expression of a heterologous DNA sequence during a 
stage of the cell cycle. 




Provided are DNA sequences encoding cell cycle interacting proteins as well as 
methods for obtaining the same. Furthermore, vectors comprising said DNA sequences 
are described, wherein the DNA sequences are operatively linked to regulatory 
elements allowing expression in prokaryotic and/or eukaryotic host cells. In addition, 
proteins encoded by said DNA sequences, antibodies to said proteins and methods for 
their production are provided. Also described is a method for controlling or altering 
growth characteristics of a plant and/or a plant cell comprising introduction and/or 
expression of one or more cell cycle regulatory proteins functional in a plant or parts 
thereof and/or one or more DNA sequences encoding such proteins. Also provided are 
regulatory sequences controling the expression of the above described cell cycle 
interacting proteins. Methods for the identification of compounds being capable of 
activating or inhibiting the cell cycle are described as well. Further described are 
diagnostic compositions comprising the aforementioned DNA sequences, regulatory 
sequences, proteins, antibodies, inhibitors and activators. Furthermore, transgenic plant 
cells, plant tissue and plants containing the above-described DNA sequences and 
vectors are described as well as the use of the aforementioned DNA sequences, 
vectors, proteins, regulatory sequences, antibodies and/or compounds identified by the 
method of the invention in plant cell and tissue culture, plant breeding and/or agriculture. 
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